2024-08-20T22:09:22.3436133Z Current runner version: '2.319.1' 2024-08-20T22:09:22.3442501Z Runner name: 'i-0b43e2cc0d7540218' 2024-08-20T22:09:22.3443271Z Runner group name: 'Default' 2024-08-20T22:09:22.3444110Z Machine name: 'ip-10-0-70-243' 2024-08-20T22:09:22.3460183Z Testing runner upgrade compatibility 2024-08-20T22:09:22.5116666Z ##[group]GITHUB_TOKEN Permissions 2024-08-20T22:09:22.5118766Z Actions: read 2024-08-20T22:09:22.5119503Z Attestations: read 2024-08-20T22:09:22.5119991Z Checks: read 2024-08-20T22:09:22.5120484Z Contents: read 2024-08-20T22:09:22.5121068Z Deployments: read 2024-08-20T22:09:22.5121552Z Discussions: read 2024-08-20T22:09:22.5122058Z Issues: read 2024-08-20T22:09:22.5122611Z Metadata: read 2024-08-20T22:09:22.5123072Z Packages: read 2024-08-20T22:09:22.5123569Z Pages: read 2024-08-20T22:09:22.5124125Z PullRequests: read 2024-08-20T22:09:22.5124637Z RepositoryProjects: read 2024-08-20T22:09:22.5125229Z SecurityEvents: read 2024-08-20T22:09:22.5125837Z Statuses: read 2024-08-20T22:09:22.5126476Z ##[endgroup] 2024-08-20T22:09:22.5129597Z Secret source: Actions 2024-08-20T22:09:22.5130302Z Prepare workflow directory 2024-08-20T22:09:22.6041260Z Prepare all required actions 2024-08-20T22:09:22.6205182Z Getting action download info 2024-08-20T22:09:22.8017399Z Download action repository 'pytorch/test-infra@main' (SHA:0c3a2634aaa2f638c8f640e743f03d696ce1191f) 2024-08-20T22:09:23.3181567Z Download action repository 'pytorch/pytorch@main' (SHA:1ae5d5bb62141d7c9b1b0b66c66a462b4e10b1f2) 2024-08-20T22:09:27.3096952Z Download action repository 'aws-actions/configure-aws-credentials@v3' (SHA:50ac8dd1e1b10d09dac7b8727528b91bed831ac0) 2024-08-20T22:09:27.4889343Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2024-08-20T22:09:27.7716906Z Getting action download info 2024-08-20T22:09:27.8660363Z Download action repository 'malfet/checkout@silent-checkout' (SHA:e07af140b3ccefc05679e3755b9db68f4ee4589c) 2024-08-20T22:09:28.0470544Z Getting action download info 2024-08-20T22:09:28.1563036Z Download action repository 'nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482' (SHA:3e91a01664abd3c5cd539100d10d33b9c5b68482) 2024-08-20T22:09:28.3031729Z Uses: pytorch/pytorch/.github/workflows/_linux-test.yml@refs/tags/ciflow/trunk/133712 (40ec5f6ddd9787aca0449b24128343ff4c4a88b3) 2024-08-20T22:09:28.3033799Z ##[group] Inputs 2024-08-20T22:09:28.3034264Z build-environment: linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:09:28.3036722Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 2, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 3, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 4, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 5, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}]} 2024-08-20T22:09:28.3039744Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:09:28.3040801Z sync-tag: 2024-08-20T22:09:28.3041602Z timeout-minutes: 240 2024-08-20T22:09:28.3041948Z use-gha: 2024-08-20T22:09:28.3042229Z dashboard-tag: 2024-08-20T22:09:28.3042549Z s3-bucket: gha-artifacts 2024-08-20T22:09:28.3042904Z aws-role-to-assume: 2024-08-20T22:09:28.3043230Z ##[endgroup] 2024-08-20T22:09:28.3044154Z Complete job name: linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:09:28.3754615Z A job started hook has been configured by the self-hosted runner administrator 2024-08-20T22:09:28.3893889Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2024-08-20T22:09:28.3905432Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:09:28.3905945Z ##[endgroup] 2024-08-20T22:09:29.5492172Z Runner Type: amz2023.linux.g5.4xlarge.nvidia.gpu 2024-08-20T22:09:29.5493110Z Instance Type: g5.4xlarge 2024-08-20T22:09:29.5493811Z AMI Name: al2023-ami-2023.5.20240701.0-kernel-6.1-x86_64 2024-08-20T22:09:29.5494351Z AMI ID: ami-06c68f701d8090592 2024-08-20T22:09:35.3181090Z ##[group]Run pytorch/test-infra/.github/actions/setup-ssh@main 2024-08-20T22:09:35.3181670Z with: 2024-08-20T22:09:35.3182359Z github-secret: *** 2024-08-20T22:09:35.3183258Z instructions: All testing is done inside the container, to start an interactive session run: docker exec -it $(docker container ps --format '{{.ID}}') bash 2024-08-20T22:09:35.3184221Z activate-with-label: false 2024-08-20T22:09:35.3184569Z label: with-ssh 2024-08-20T22:09:35.3184885Z remove-existing-keys: true 2024-08-20T22:09:35.3185243Z fail-silently: true 2024-08-20T22:09:35.3185544Z env: 2024-08-20T22:09:35.3185812Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:09:35.3186147Z ##[endgroup] 2024-08-20T22:09:35.4066910Z Please see https://github.com/pytorch/pytorch/wiki/Debugging-using-with-ssh-for-Github-Actions for more info. 2024-08-20T22:09:35.4070654Z ciflow reference detected, attempting to extract PR number 2024-08-20T22:09:35.7106889Z Grabbing public ssh keys from https://github.com/pytorch-bot[bot].keys 2024-08-20T22:09:35.7786800Z No SSH keys found for user pytorch-bot[bot] 2024-08-20T22:09:35.7787711Z Grabbing public ssh keys from https://github.com/XuehaiPan.keys 2024-08-20T22:09:35.8545824Z ~/.ssh/authorized_keys file found on node, removing ~/.ssh and starting fresh 2024-08-20T22:09:35.8561178Z Public keys pulled and installed to /home/ec2-user/.ssh/authorized_keys 2024-08-20T22:09:35.8586366Z Login using: ssh ec2-user@ec2-34-239-150-24.compute-1.amazonaws.com 2024-08-20T22:09:35.8587781Z All testing is done inside the container, to start an interactive session run: 2024-08-20T22:09:35.8589061Z docker exec -it $(docker container ps --format '{{.ID}}') bash 2024-08-20T22:09:35.8717170Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2024-08-20T22:09:35.8717736Z with: 2024-08-20T22:09:35.8718025Z submodules: recursive 2024-08-20T22:09:35.8718368Z fetch-depth: 0 2024-08-20T22:09:35.8718661Z env: 2024-08-20T22:09:35.8718940Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:09:35.8719286Z ##[endgroup] 2024-08-20T22:09:35.8917407Z ##[group]Run retry () { 2024-08-20T22:09:35.8917783Z retry () { 2024-08-20T22:09:35.8918284Z  $* || (sleep 1 && $*) || (sleep 2 && $*) || (sleep 4 && $*) || (sleep 8 && $*) 2024-08-20T22:09:35.8918827Z } 2024-08-20T22:09:35.8919123Z echo "${GITHUB_WORKSPACE}" 2024-08-20T22:09:35.8919600Z if [ -z "${NO_SUDO}" ]; then 2024-08-20T22:09:35.8920070Z  retry sudo rm -rf "${GITHUB_WORKSPACE}" 2024-08-20T22:09:35.8920513Z else 2024-08-20T22:09:35.8920857Z  retry rm -rf "${GITHUB_WORKSPACE}" 2024-08-20T22:09:35.8921276Z fi 2024-08-20T22:09:35.8921610Z mkdir "${GITHUB_WORKSPACE}" 2024-08-20T22:09:35.8934919Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:09:35.8935430Z env: 2024-08-20T22:09:35.8935713Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:09:35.8936054Z NO_SUDO: 2024-08-20T22:09:35.8936333Z ##[endgroup] 2024-08-20T22:09:35.8968358Z /home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T22:09:36.0552766Z ##[group]Run malfet/checkout@silent-checkout 2024-08-20T22:09:36.0553188Z with: 2024-08-20T22:09:36.0553513Z ref: 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:09:36.0553949Z fetch-depth: 0 2024-08-20T22:09:36.0554251Z submodules: recursive 2024-08-20T22:09:36.0554583Z quiet-checkout: true 2024-08-20T22:09:36.0554925Z repository: pytorch/pytorch 2024-08-20T22:09:36.0555389Z token: *** 2024-08-20T22:09:36.0555682Z ssh-strict: true 2024-08-20T22:09:36.0556003Z persist-credentials: true 2024-08-20T22:09:36.0556347Z clean: true 2024-08-20T22:09:36.0556674Z sparse-checkout-cone-mode: true 2024-08-20T22:09:36.0557053Z lfs: false 2024-08-20T22:09:36.0557343Z set-safe-directory: true 2024-08-20T22:09:36.0557870Z env: 2024-08-20T22:09:36.0558140Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:09:36.0558469Z ##[endgroup] 2024-08-20T22:09:36.1520078Z Syncing repository: pytorch/pytorch 2024-08-20T22:09:36.1521551Z ##[group]Getting Git version info 2024-08-20T22:09:36.1522265Z Working directory is '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2024-08-20T22:09:36.1523098Z [command]/usr/bin/git version 2024-08-20T22:09:36.1523461Z git version 2.40.1 2024-08-20T22:09:36.1525922Z ##[endgroup] 2024-08-20T22:09:36.1539549Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/d67f8969-9dfa-4f12-8f38-e6c4cf368b10' before making global git config changes 2024-08-20T22:09:36.1540760Z Adding repository directory to the temporary git global config as a safe directory 2024-08-20T22:09:36.1544550Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T22:09:36.1594826Z Deleting the contents of '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2024-08-20T22:09:36.1598491Z ##[group]Initializing the repository 2024-08-20T22:09:36.1601395Z [command]/usr/bin/git init /home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T22:09:36.1644307Z hint: Using 'master' as the name for the initial branch. This default branch name 2024-08-20T22:09:36.1645185Z hint: is subject to change. To configure the initial branch name to use in all 2024-08-20T22:09:36.1645970Z hint: of your new repositories, which will suppress this warning, call: 2024-08-20T22:09:36.1646542Z hint: 2024-08-20T22:09:36.1647279Z hint: git config --global init.defaultBranch 2024-08-20T22:09:36.1647743Z hint: 2024-08-20T22:09:36.1648250Z hint: Names commonly chosen instead of 'master' are 'main', 'trunk' and 2024-08-20T22:09:36.1649073Z hint: 'development'. The just-created branch can be renamed via this command: 2024-08-20T22:09:36.1649931Z hint: 2024-08-20T22:09:36.1650262Z hint: git branch -m 2024-08-20T22:09:36.1651176Z Initialized empty Git repository in /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/ 2024-08-20T22:09:36.1658094Z [command]/usr/bin/git remote add origin https://github.com/pytorch/pytorch 2024-08-20T22:09:36.1699663Z ##[endgroup] 2024-08-20T22:09:36.1700246Z ##[group]Disabling automatic garbage collection 2024-08-20T22:09:36.1702540Z [command]/usr/bin/git config --local gc.auto 0 2024-08-20T22:09:36.1742792Z ##[endgroup] 2024-08-20T22:09:36.1743306Z ##[group]Setting up auth 2024-08-20T22:09:36.1748834Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-08-20T22:09:36.1790435Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-08-20T22:09:36.2159750Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-08-20T22:09:36.2200548Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-08-20T22:09:36.2554611Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-08-20T22:09:36.2610868Z ##[endgroup] 2024-08-20T22:09:36.2611601Z ##[group]Fetching the repository 2024-08-20T22:09:36.2616915Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --progress --no-recurse-submodules --quiet origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2024-08-20T22:09:38.7913603Z remote: Enumerating objects: 1008452 2024-08-20T22:09:38.7914255Z remote: Enumerating objects: 1010229, done. 2024-08-20T22:09:38.7914993Z remote: Counting objects: 0% (1/1777) 2024-08-20T22:09:38.7915515Z remote: Counting objects: 1% (18/1777) 2024-08-20T22:09:38.7916114Z remote: Counting objects: 2% (36/1777) 2024-08-20T22:09:38.7916614Z remote: Counting objects: 3% (54/1777) 2024-08-20T22:09:38.7917332Z remote: Counting objects: 4% (72/1777) 2024-08-20T22:09:38.7917931Z remote: Counting objects: 5% (89/1777) 2024-08-20T22:09:38.7918416Z remote: Counting objects: 6% (107/1777) 2024-08-20T22:09:38.7919010Z remote: Counting objects: 7% (125/1777) 2024-08-20T22:09:38.7919595Z remote: Counting objects: 8% (143/1777) 2024-08-20T22:09:38.7920144Z remote: Counting objects: 9% (160/1777) 2024-08-20T22:09:38.7920677Z remote: Counting objects: 10% (178/1777) 2024-08-20T22:09:38.7921293Z remote: Counting objects: 11% (196/1777) 2024-08-20T22:09:38.7921841Z remote: Counting objects: 12% (214/1777) 2024-08-20T22:09:38.7922340Z remote: Counting objects: 13% (232/1777) 2024-08-20T22:09:38.7923003Z remote: Counting objects: 14% (249/1777) 2024-08-20T22:09:38.7923678Z remote: Counting objects: 15% (267/1777) 2024-08-20T22:09:38.7924323Z remote: Counting objects: 16% (285/1777) 2024-08-20T22:09:38.7924916Z remote: Counting objects: 17% (303/1777) 2024-08-20T22:09:38.7925502Z remote: Counting objects: 18% (320/1777) 2024-08-20T22:09:38.7926126Z remote: Counting objects: 19% (338/1777) 2024-08-20T22:09:38.7926703Z remote: Counting objects: 20% (356/1777) 2024-08-20T22:09:38.7927276Z remote: Counting objects: 21% (374/1777) 2024-08-20T22:09:38.7927875Z remote: Counting objects: 22% (391/1777) 2024-08-20T22:09:38.7928468Z remote: Counting objects: 23% (409/1777) 2024-08-20T22:09:38.7929009Z remote: Counting objects: 24% (427/1777) 2024-08-20T22:09:38.7929592Z remote: Counting objects: 25% (445/1777) 2024-08-20T22:09:38.7930200Z remote: Counting objects: 26% (463/1777) 2024-08-20T22:09:38.7930832Z remote: Counting objects: 27% (480/1777) 2024-08-20T22:09:38.7931667Z remote: Counting objects: 28% (498/1777) 2024-08-20T22:09:38.7932239Z remote: Counting objects: 29% (516/1777) 2024-08-20T22:09:38.7932890Z remote: Counting objects: 30% (534/1777) 2024-08-20T22:09:38.7933544Z remote: Counting objects: 31% (551/1777) 2024-08-20T22:09:38.7934191Z remote: Counting objects: 32% (569/1777) 2024-08-20T22:09:38.7934861Z remote: Counting objects: 33% (587/1777) 2024-08-20T22:09:38.7935350Z remote: Counting objects: 34% (605/1777) 2024-08-20T22:09:38.7935855Z remote: Counting objects: 35% (622/1777) 2024-08-20T22:09:38.7936511Z remote: Counting objects: 36% (640/1777) 2024-08-20T22:09:38.7937006Z remote: Counting objects: 37% (658/1777) 2024-08-20T22:09:38.7937496Z remote: Counting objects: 38% (676/1777) 2024-08-20T22:09:38.7937974Z remote: Counting objects: 39% (694/1777) 2024-08-20T22:09:38.7938461Z remote: Counting objects: 40% (711/1777) 2024-08-20T22:09:38.7938961Z remote: Counting objects: 41% (729/1777) 2024-08-20T22:09:38.7939440Z remote: Counting objects: 42% (747/1777) 2024-08-20T22:09:38.7939934Z remote: Counting objects: 43% (765/1777) 2024-08-20T22:09:38.7940422Z remote: Counting objects: 44% (782/1777) 2024-08-20T22:09:38.7940899Z remote: Counting objects: 45% (800/1777) 2024-08-20T22:09:38.7941382Z remote: Counting objects: 46% (818/1777) 2024-08-20T22:09:38.7941876Z remote: Counting objects: 47% (836/1777) 2024-08-20T22:09:38.7942355Z remote: Counting objects: 48% (853/1777) 2024-08-20T22:09:38.7942848Z remote: Counting objects: 49% (871/1777) 2024-08-20T22:09:38.7943333Z remote: Counting objects: 50% (889/1777) 2024-08-20T22:09:38.7943821Z remote: Counting objects: 51% (907/1777) 2024-08-20T22:09:38.7944301Z remote: Counting objects: 52% (925/1777) 2024-08-20T22:09:38.7944793Z remote: Counting objects: 53% (942/1777) 2024-08-20T22:09:38.7945284Z remote: Counting objects: 54% (960/1777) 2024-08-20T22:09:38.7945899Z remote: Counting objects: 55% (978/1777) 2024-08-20T22:09:38.7946393Z remote: Counting objects: 56% (996/1777) 2024-08-20T22:09:38.7946885Z remote: Counting objects: 57% (1013/1777) 2024-08-20T22:09:38.7947393Z remote: Counting objects: 58% (1031/1777) 2024-08-20T22:09:38.7947904Z remote: Counting objects: 59% (1049/1777) 2024-08-20T22:09:38.7948410Z remote: Counting objects: 60% (1067/1777) 2024-08-20T22:09:38.7948908Z remote: Counting objects: 61% (1084/1777) 2024-08-20T22:09:38.7949410Z remote: Counting objects: 62% (1102/1777) 2024-08-20T22:09:38.7949926Z remote: Counting objects: 63% (1120/1777) 2024-08-20T22:09:38.7950422Z remote: Counting objects: 64% (1138/1777) 2024-08-20T22:09:38.7950923Z remote: Counting objects: 65% (1156/1777) 2024-08-20T22:09:38.7951424Z remote: Counting objects: 66% (1173/1777) 2024-08-20T22:09:38.7951922Z remote: Counting objects: 67% (1191/1777) 2024-08-20T22:09:38.7952444Z remote: Counting objects: 68% (1209/1777) 2024-08-20T22:09:38.7952935Z remote: Counting objects: 69% (1227/1777) 2024-08-20T22:09:38.7953475Z remote: Counting objects: 70% (1244/1777) 2024-08-20T22:09:38.7953979Z remote: Counting objects: 71% (1262/1777) 2024-08-20T22:09:38.7954466Z remote: Counting objects: 72% (1280/1777) 2024-08-20T22:09:38.7954958Z remote: Counting objects: 73% (1298/1777) 2024-08-20T22:09:38.7955453Z remote: Counting objects: 74% (1315/1777) 2024-08-20T22:09:38.7955938Z remote: Counting objects: 75% (1333/1777) 2024-08-20T22:09:38.7956428Z remote: Counting objects: 76% (1351/1777) 2024-08-20T22:09:38.7956922Z remote: Counting objects: 77% (1369/1777) 2024-08-20T22:09:38.7957408Z remote: Counting objects: 78% (1387/1777) 2024-08-20T22:09:38.7957996Z remote: Counting objects: 79% (1404/1777) 2024-08-20T22:09:38.7958501Z remote: Counting objects: 80% (1422/1777) 2024-08-20T22:09:38.7959005Z remote: Counting objects: 81% (1440/1777) 2024-08-20T22:09:38.7959595Z remote: Counting objects: 82% (1458/1777) 2024-08-20T22:09:38.7960097Z remote: Counting objects: 83% (1475/1777) 2024-08-20T22:09:38.7960595Z remote: Counting objects: 84% (1493/1777) 2024-08-20T22:09:38.7961097Z remote: Counting objects: 85% (1511/1777) 2024-08-20T22:09:38.7961596Z remote: Counting objects: 86% (1529/1777) 2024-08-20T22:09:38.7962096Z remote: Counting objects: 87% (1546/1777) 2024-08-20T22:09:38.7962589Z remote: Counting objects: 88% (1564/1777) 2024-08-20T22:09:38.7963090Z remote: Counting objects: 89% (1582/1777) 2024-08-20T22:09:38.7963644Z remote: Counting objects: 90% (1600/1777) 2024-08-20T22:09:38.7964135Z remote: Counting objects: 91% (1618/1777) 2024-08-20T22:09:38.7964644Z remote: Counting objects: 92% (1635/1777) 2024-08-20T22:09:38.7965148Z remote: Counting objects: 93% (1653/1777) 2024-08-20T22:09:38.7965645Z remote: Counting objects: 94% (1671/1777) 2024-08-20T22:09:38.7966143Z remote: Counting objects: 95% (1689/1777) 2024-08-20T22:09:38.7966641Z remote: Counting objects: 96% (1706/1777) 2024-08-20T22:09:38.7967137Z remote: Counting objects: 97% (1724/1777) 2024-08-20T22:09:38.7967643Z remote: Counting objects: 98% (1742/1777) 2024-08-20T22:09:38.7968479Z remote: Counting objects: 99% (1760/1777) 2024-08-20T22:09:38.7968976Z remote: Counting objects: 100% (1777/1777) 2024-08-20T22:09:38.7969517Z remote: Counting objects: 100% (1777/1777), done. 2024-08-20T22:09:38.8067398Z remote: Compressing objects: 0% (1/846) 2024-08-20T22:09:38.8332366Z remote: Compressing objects: 1% (9/846) 2024-08-20T22:09:38.8641424Z remote: Compressing objects: 2% (17/846) 2024-08-20T22:09:38.8971945Z remote: Compressing objects: 3% (26/846) 2024-08-20T22:09:38.9665130Z remote: Compressing objects: 4% (34/846) 2024-08-20T22:09:39.0144657Z remote: Compressing objects: 5% (43/846) 2024-08-20T22:09:39.0644787Z remote: Compressing objects: 6% (51/846) 2024-08-20T22:09:39.1025331Z remote: Compressing objects: 7% (60/846) 2024-08-20T22:09:39.1448698Z remote: Compressing objects: 8% (68/846) 2024-08-20T22:09:39.1663504Z remote: Compressing objects: 9% (77/846) 2024-08-20T22:09:39.1923768Z remote: Compressing objects: 10% (85/846) 2024-08-20T22:09:39.2106864Z remote: Compressing objects: 11% (94/846) 2024-08-20T22:09:39.2291636Z remote: Compressing objects: 12% (102/846) 2024-08-20T22:09:39.2451824Z remote: Compressing objects: 13% (110/846) 2024-08-20T22:09:39.2563485Z remote: Compressing objects: 14% (119/846) 2024-08-20T22:09:39.2663443Z remote: Compressing objects: 15% (127/846) 2024-08-20T22:09:39.2734660Z remote: Compressing objects: 16% (136/846) 2024-08-20T22:09:39.2803952Z remote: Compressing objects: 17% (144/846) 2024-08-20T22:09:39.2845255Z remote: Compressing objects: 18% (153/846) 2024-08-20T22:09:39.2893717Z remote: Compressing objects: 19% (161/846) 2024-08-20T22:09:39.2924731Z remote: Compressing objects: 20% (170/846) 2024-08-20T22:09:39.2946668Z remote: Compressing objects: 21% (178/846) 2024-08-20T22:09:39.2948728Z remote: Compressing objects: 22% (187/846) 2024-08-20T22:09:39.2953438Z remote: Compressing objects: 23% (195/846) 2024-08-20T22:09:39.2961498Z remote: Compressing objects: 24% (204/846) 2024-08-20T22:09:39.2971585Z remote: Compressing objects: 25% (212/846) 2024-08-20T22:09:39.2973197Z remote: Compressing objects: 26% (220/846) 2024-08-20T22:09:39.2977216Z remote: Compressing objects: 27% (229/846) 2024-08-20T22:09:39.2992197Z remote: Compressing objects: 28% (237/846) 2024-08-20T22:09:39.3003146Z remote: Compressing objects: 29% (246/846) 2024-08-20T22:09:39.3012805Z remote: Compressing objects: 30% (254/846) 2024-08-20T22:09:39.3023158Z remote: Compressing objects: 31% (263/846) 2024-08-20T22:09:39.3027031Z remote: Compressing objects: 32% (271/846) 2024-08-20T22:09:39.3035984Z remote: Compressing objects: 33% (280/846) 2024-08-20T22:09:39.3042765Z remote: Compressing objects: 34% (288/846) 2024-08-20T22:09:39.3046598Z remote: Compressing objects: 35% (297/846) 2024-08-20T22:09:39.3053536Z remote: Compressing objects: 36% (305/846) 2024-08-20T22:09:39.3064085Z remote: Compressing objects: 37% (314/846) 2024-08-20T22:09:39.3067873Z remote: Compressing objects: 38% (322/846) 2024-08-20T22:09:39.3074534Z remote: Compressing objects: 39% (330/846) 2024-08-20T22:09:39.3076523Z remote: Compressing objects: 40% (339/846) 2024-08-20T22:09:39.3082552Z remote: Compressing objects: 41% (347/846) 2024-08-20T22:09:39.3093695Z remote: Compressing objects: 42% (356/846) 2024-08-20T22:09:39.3095541Z remote: Compressing objects: 43% (364/846) 2024-08-20T22:09:39.3102065Z remote: Compressing objects: 44% (373/846) 2024-08-20T22:09:39.3107320Z remote: Compressing objects: 45% (381/846) 2024-08-20T22:09:39.3111840Z remote: Compressing objects: 46% (390/846) 2024-08-20T22:09:39.3114625Z remote: Compressing objects: 47% (398/846) 2024-08-20T22:09:39.3117806Z remote: Compressing objects: 48% (407/846) 2024-08-20T22:09:39.3121146Z remote: Compressing objects: 49% (415/846) 2024-08-20T22:09:39.3123565Z remote: Compressing objects: 50% (423/846) 2024-08-20T22:09:39.3126713Z remote: Compressing objects: 51% (432/846) 2024-08-20T22:09:39.3128868Z remote: Compressing objects: 52% (440/846) 2024-08-20T22:09:39.3130428Z remote: Compressing objects: 53% (449/846) 2024-08-20T22:09:39.3132040Z remote: Compressing objects: 54% (457/846) 2024-08-20T22:09:39.3133221Z remote: Compressing objects: 55% (466/846) 2024-08-20T22:09:39.3134344Z remote: Compressing objects: 56% (474/846) 2024-08-20T22:09:39.3135066Z remote: Compressing objects: 57% (483/846) 2024-08-20T22:09:39.3137492Z remote: Compressing objects: 58% (491/846) 2024-08-20T22:09:39.3138251Z remote: Compressing objects: 59% (500/846) 2024-08-20T22:09:39.3138892Z remote: Compressing objects: 60% (508/846) 2024-08-20T22:09:39.3139417Z remote: Compressing objects: 61% (517/846) 2024-08-20T22:09:39.3139945Z remote: Compressing objects: 62% (525/846) 2024-08-20T22:09:39.3140465Z remote: Compressing objects: 63% (533/846) 2024-08-20T22:09:39.3143644Z remote: Compressing objects: 64% (542/846) 2024-08-20T22:09:39.3149944Z remote: Compressing objects: 65% (550/846) 2024-08-20T22:09:39.3155750Z remote: Compressing objects: 66% (559/846) 2024-08-20T22:09:39.3160502Z remote: Compressing objects: 67% (567/846) 2024-08-20T22:09:39.3164591Z remote: Compressing objects: 68% (576/846) 2024-08-20T22:09:39.3169093Z remote: Compressing objects: 69% (584/846) 2024-08-20T22:09:39.3172965Z remote: Compressing objects: 70% (593/846) 2024-08-20T22:09:39.3176303Z remote: Compressing objects: 71% (601/846) 2024-08-20T22:09:39.3178795Z remote: Compressing objects: 72% (610/846) 2024-08-20T22:09:39.3182896Z remote: Compressing objects: 73% (618/846) 2024-08-20T22:09:39.3185711Z remote: Compressing objects: 74% (627/846) 2024-08-20T22:09:39.3188308Z remote: Compressing objects: 75% (635/846) 2024-08-20T22:09:39.3190929Z remote: Compressing objects: 76% (643/846) 2024-08-20T22:09:39.3193533Z remote: Compressing objects: 77% (652/846) 2024-08-20T22:09:39.3195591Z remote: Compressing objects: 78% (660/846) 2024-08-20T22:09:39.3197154Z remote: Compressing objects: 79% (669/846) 2024-08-20T22:09:39.3199105Z remote: Compressing objects: 80% (677/846) 2024-08-20T22:09:39.3201460Z remote: Compressing objects: 81% (686/846) 2024-08-20T22:09:39.3203059Z remote: Compressing objects: 82% (694/846) 2024-08-20T22:09:39.3204341Z remote: Compressing objects: 83% (703/846) 2024-08-20T22:09:39.3206393Z remote: Compressing objects: 84% (711/846) 2024-08-20T22:09:39.3207868Z remote: Compressing objects: 85% (720/846) 2024-08-20T22:09:39.3209945Z remote: Compressing objects: 86% (728/846) 2024-08-20T22:09:39.3210924Z remote: Compressing objects: 87% (737/846) 2024-08-20T22:09:39.3212988Z remote: Compressing objects: 88% (745/846) 2024-08-20T22:09:39.3215636Z remote: Compressing objects: 89% (753/846) 2024-08-20T22:09:39.3216983Z remote: Compressing objects: 90% (762/846) 2024-08-20T22:09:39.3218026Z remote: Compressing objects: 91% (770/846) 2024-08-20T22:09:39.3218657Z remote: Compressing objects: 92% (779/846) 2024-08-20T22:09:39.3219765Z remote: Compressing objects: 93% (787/846) 2024-08-20T22:09:39.3220282Z remote: Compressing objects: 94% (796/846) 2024-08-20T22:09:39.3222522Z remote: Compressing objects: 95% (804/846) 2024-08-20T22:09:39.3223442Z remote: Compressing objects: 96% (813/846) 2024-08-20T22:09:39.3224104Z remote: Compressing objects: 97% (821/846) 2024-08-20T22:09:39.3225349Z remote: Compressing objects: 98% (830/846) 2024-08-20T22:09:39.3226281Z remote: Compressing objects: 99% (838/846) 2024-08-20T22:09:39.3226813Z remote: Compressing objects: 100% (846/846) 2024-08-20T22:09:39.3227364Z remote: Compressing objects: 100% (846/846), done. 2024-08-20T22:10:02.7614561Z remote: Total 1010229 (delta 1169), reused 1411 (delta 928), pack-reused 1008452 (from 1) 2024-08-20T22:10:23.9072570Z [command]/usr/bin/git rev-parse --verify --quiet 40ec5f6ddd9787aca0449b24128343ff4c4a88b3^{object} 2024-08-20T22:10:23.9109805Z 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:10:23.9115013Z ##[endgroup] 2024-08-20T22:10:23.9115705Z ##[group]Determining the checkout info 2024-08-20T22:10:23.9117130Z ##[endgroup] 2024-08-20T22:10:23.9117870Z ##[group]Checking out the ref 2024-08-20T22:10:23.9119185Z [command]/usr/bin/git checkout --quiet --force 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:10:25.6129203Z ##[endgroup] 2024-08-20T22:10:25.6129813Z ##[group]Setting up auth for fetching submodules 2024-08-20T22:10:25.6133912Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-08-20T22:10:25.6204769Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2024-08-20T22:10:25.6241817Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2024-08-20T22:10:25.6280926Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2024-08-20T22:10:25.6318225Z ##[endgroup] 2024-08-20T22:10:25.6319251Z ##[group]Fetching submodules 2024-08-20T22:10:25.6320972Z [command]/usr/bin/git submodule sync --recursive 2024-08-20T22:10:25.6695668Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2024-08-20T22:10:25.7058038Z Submodule 'android/libs/fbjni' (https://github.com/facebookincubator/fbjni.git) registered for path 'android/libs/fbjni' 2024-08-20T22:10:25.7060790Z Submodule 'third_party/NNPACK_deps/FP16' (https://github.com/Maratyszcza/FP16.git) registered for path 'third_party/FP16' 2024-08-20T22:10:25.7063481Z Submodule 'third_party/NNPACK_deps/FXdiv' (https://github.com/Maratyszcza/FXdiv.git) registered for path 'third_party/FXdiv' 2024-08-20T22:10:25.7064991Z Submodule 'third_party/NNPACK' (https://github.com/Maratyszcza/NNPACK.git) registered for path 'third_party/NNPACK' 2024-08-20T22:10:25.7069550Z Submodule 'third_party/VulkanMemoryAllocator' (https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator.git) registered for path 'third_party/VulkanMemoryAllocator' 2024-08-20T22:10:25.7072591Z Submodule 'third_party/XNNPACK' (https://github.com/google/XNNPACK.git) registered for path 'third_party/XNNPACK' 2024-08-20T22:10:25.7076144Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/benchmark' 2024-08-20T22:10:25.7079732Z Submodule 'third_party/cpp-httplib' (https://github.com/yhirose/cpp-httplib.git) registered for path 'third_party/cpp-httplib' 2024-08-20T22:10:25.7083500Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo.git) registered for path 'third_party/cpuinfo' 2024-08-20T22:10:25.7087372Z Submodule 'third_party/cudnn_frontend' (https://github.com/NVIDIA/cudnn-frontend.git) registered for path 'third_party/cudnn_frontend' 2024-08-20T22:10:25.7091117Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/cutlass' 2024-08-20T22:10:25.7094955Z Submodule 'third_party/eigen' (https://gitlab.com/libeigen/eigen.git) registered for path 'third_party/eigen' 2024-08-20T22:10:25.7098840Z Submodule 'third_party/fbgemm' (https://github.com/pytorch/fbgemm) registered for path 'third_party/fbgemm' 2024-08-20T22:10:25.7102940Z Submodule 'third_party/flatbuffers' (https://github.com/google/flatbuffers.git) registered for path 'third_party/flatbuffers' 2024-08-20T22:10:25.7106881Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/fmt' 2024-08-20T22:10:25.7112278Z Submodule 'third_party/gemmlowp/gemmlowp' (https://github.com/google/gemmlowp.git) registered for path 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:10:25.7120009Z Submodule 'third_party/gloo' (https://github.com/facebookincubator/gloo) registered for path 'third_party/gloo' 2024-08-20T22:10:25.7124504Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/googletest' 2024-08-20T22:10:25.7128671Z Submodule 'third_party/ideep' (https://github.com/intel/ideep) registered for path 'third_party/ideep' 2024-08-20T22:10:25.7133250Z Submodule 'third_party/ittapi' (https://github.com/intel/ittapi.git) registered for path 'third_party/ittapi' 2024-08-20T22:10:25.7137716Z Submodule 'third_party/kineto' (https://github.com/pytorch/kineto) registered for path 'third_party/kineto' 2024-08-20T22:10:25.7142314Z Submodule 'third_party/mimalloc' (https://github.com/microsoft/mimalloc.git) registered for path 'third_party/mimalloc' 2024-08-20T22:10:25.7146847Z Submodule 'third_party/nccl/nccl' (https://github.com/NVIDIA/nccl) registered for path 'third_party/nccl/nccl' 2024-08-20T22:10:25.7151586Z Submodule 'third_party/nlohmann' (https://github.com/nlohmann/json.git) registered for path 'third_party/nlohmann' 2024-08-20T22:10:25.7156313Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx' 2024-08-20T22:10:25.7161726Z Submodule 'third_party/opentelemetry-cpp' (https://github.com/open-telemetry/opentelemetry-cpp.git) registered for path 'third_party/opentelemetry-cpp' 2024-08-20T22:10:25.7166612Z Submodule 'third_party/pocketfft' (https://github.com/mreineck/pocketfft) registered for path 'third_party/pocketfft' 2024-08-20T22:10:25.7172186Z Submodule 'third_party/protobuf' (https://github.com/protocolbuffers/protobuf.git) registered for path 'third_party/protobuf' 2024-08-20T22:10:25.7177029Z Submodule 'third_party/NNPACK_deps/psimd' (https://github.com/Maratyszcza/psimd.git) registered for path 'third_party/psimd' 2024-08-20T22:10:25.7182235Z Submodule 'third_party/NNPACK_deps/pthreadpool' (https://github.com/Maratyszcza/pthreadpool.git) registered for path 'third_party/pthreadpool' 2024-08-20T22:10:25.7187361Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/pybind11' 2024-08-20T22:10:25.7192778Z Submodule 'third_party/python-peachpy' (https://github.com/malfet/PeachPy.git) registered for path 'third_party/python-peachpy' 2024-08-20T22:10:25.7197880Z Submodule 'third_party/sleef' (https://github.com/shibatch/sleef) registered for path 'third_party/sleef' 2024-08-20T22:10:25.7206904Z Submodule 'third_party/tensorpipe' (https://github.com/pytorch/tensorpipe.git) registered for path 'third_party/tensorpipe' 2024-08-20T22:10:25.7241566Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/android/libs/fbjni'... 2024-08-20T22:10:26.0229644Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FP16'... 2024-08-20T22:10:26.2282369Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FXdiv'... 2024-08-20T22:10:26.3941961Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/NNPACK'... 2024-08-20T22:10:26.6414048Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/VulkanMemoryAllocator'... 2024-08-20T22:10:28.5336930Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/XNNPACK'... 2024-08-20T22:10:39.6872758Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/benchmark'... 2024-08-20T22:10:40.1083051Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpp-httplib'... 2024-08-20T22:10:40.5177901Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpuinfo'... 2024-08-20T22:10:41.1217849Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cudnn_frontend'... 2024-08-20T22:10:42.3428536Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cutlass'... 2024-08-20T22:10:44.4649668Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/eigen'... 2024-08-20T22:10:50.4589320Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm'... 2024-08-20T22:10:51.9006183Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flatbuffers'... 2024-08-20T22:10:53.5854599Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fmt'... 2024-08-20T22:10:54.8275426Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gemmlowp/gemmlowp'... 2024-08-20T22:10:55.2646293Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gloo'... 2024-08-20T22:10:55.5994783Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/googletest'... 2024-08-20T22:10:56.7605672Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep'... 2024-08-20T22:10:57.1251743Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ittapi'... 2024-08-20T22:10:57.3842935Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto'... 2024-08-20T22:10:59.4416077Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/mimalloc'... 2024-08-20T22:11:00.1415837Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/nccl/nccl'... 2024-08-20T22:11:00.8342168Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/nlohmann'... 2024-08-20T22:11:07.0089058Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx'... 2024-08-20T22:11:08.9767394Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp'... 2024-08-20T22:11:13.2114228Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pocketfft'... 2024-08-20T22:11:13.4336571Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf'... 2024-08-20T22:11:22.2265447Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/psimd'... 2024-08-20T22:11:22.4004625Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pthreadpool'... 2024-08-20T22:11:22.5913698Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pybind11'... 2024-08-20T22:11:23.5016653Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/python-peachpy'... 2024-08-20T22:11:23.7900917Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/sleef'... 2024-08-20T22:11:24.4731802Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe'... 2024-08-20T22:11:24.9448246Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2024-08-20T22:11:24.9619437Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2024-08-20T22:11:24.9737229Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2024-08-20T22:11:25.0050358Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2024-08-20T22:11:25.0497099Z Submodule path 'third_party/VulkanMemoryAllocator': checked out 'a6bfc237255a6bac1513f7c1ebde6d8aed6b5191' 2024-08-20T22:11:26.2137158Z Submodule path 'third_party/XNNPACK': checked out 'fcbf55af6cf28a4627bcd1f703ab7ad843f0f3a2' 2024-08-20T22:11:26.2420805Z Submodule path 'third_party/benchmark': checked out '0d98dba29d66e93259db7daa53a9327df767a415' 2024-08-20T22:11:26.2934422Z Submodule path 'third_party/cpp-httplib': checked out '3b6597bba913d51161383657829b7e644e59c006' 2024-08-20T22:11:26.4038772Z Submodule path 'third_party/cpuinfo': checked out '3c8b1533ac03dd6531ab6e7b9245d488f13a82a5' 2024-08-20T22:11:26.4430978Z Submodule path 'third_party/cudnn_frontend': checked out '23511ba176243f27b3b275da1fb3814ea805a171' 2024-08-20T22:11:27.0609466Z Submodule path 'third_party/cutlass': checked out 'bbe579a9e3beb6ea6626d9227ec32d0dae119a49' 2024-08-20T22:11:27.3411441Z Submodule path 'third_party/eigen': checked out '3147391d946bb4b6c68edd901f2add6ac1f31f8c' 2024-08-20T22:11:27.4331350Z Submodule path 'third_party/fbgemm': checked out 'dbc3157bf256f1339b3fa1fef2be89ac4078be0e' 2024-08-20T22:11:27.4354298Z Submodule 'third_party/asmjit' (https://github.com/asmjit/asmjit.git) registered for path 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:11:27.4358133Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo) registered for path 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:11:27.4361671Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:11:27.4365094Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:11:27.4369013Z Submodule 'third_party/hipify_torch' (https://github.com/ROCmSoftwarePlatform/hipify_torch.git) registered for path 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:11:27.4400922Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/asmjit'... 2024-08-20T22:11:28.4987315Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cpuinfo'... 2024-08-20T22:11:29.1100933Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cutlass'... 2024-08-20T22:11:31.3161584Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/googletest'... 2024-08-20T22:11:32.3671208Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/hipify_torch'... 2024-08-20T22:11:32.6934276Z Submodule path 'third_party/fbgemm/third_party/asmjit': checked out 'd3fbf7c9bc7c1d1365a94a45614b91c5a3706b81' 2024-08-20T22:11:32.8043431Z Submodule path 'third_party/fbgemm/third_party/cpuinfo': checked out 'ed8b86a253800bafdb7b25c5c399f91bff9cb1f3' 2024-08-20T22:11:33.3098901Z Submodule path 'third_party/fbgemm/third_party/cutlass': checked out 'fc9ebc645b63f3a6bc80aaefde5c063fb72110d6' 2024-08-20T22:11:33.3803780Z Submodule path 'third_party/fbgemm/third_party/googletest': checked out 'cbf019de22c8dd37b2108da35b2748fd702d1796' 2024-08-20T22:11:33.3953937Z Submodule path 'third_party/fbgemm/third_party/hipify_torch': checked out '23f53b025b466d8ec3c45d52290d3442f7fbe6b1' 2024-08-20T22:11:33.5455132Z Submodule path 'third_party/flatbuffers': checked out '01834de25e4bf3975a9a00e816292b1ad0fe184b' 2024-08-20T22:11:33.5916029Z Submodule path 'third_party/fmt': checked out '0c9fce2ffefecfdce794e1859584e25877b7b592' 2024-08-20T22:11:33.6364546Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2024-08-20T22:11:33.6685430Z Submodule path 'third_party/gloo': checked out '5354032ea08eadd7fc4456477f7f7c6308818509' 2024-08-20T22:11:33.7202389Z Submodule path 'third_party/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-08-20T22:11:33.7359396Z Submodule path 'third_party/ideep': checked out '55ca0191687aaf19aca5cdb7881c791e3bea442b' 2024-08-20T22:11:33.7381603Z Submodule 'mkl-dnn' (https://github.com/intel/mkl-dnn.git) registered for path 'third_party/ideep/mkl-dnn' 2024-08-20T22:11:33.7411780Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep/mkl-dnn'... 2024-08-20T22:11:46.8810647Z Submodule path 'third_party/ideep/mkl-dnn': checked out '1137e04ec0b5251ca2b4400a4fd3c667ce843d67' 2024-08-20T22:11:46.9026674Z Submodule path 'third_party/ittapi': checked out '5b8a7d7422611c3a0d799fb5fc5dd4abfae35b42' 2024-08-20T22:11:47.0018889Z Submodule path 'third_party/kineto': checked out 'd9753139d181b9ff42872465aac0e5d3018be415' 2024-08-20T22:11:47.0042767Z Submodule 'libkineto/third_party/dynolog' (https://github.com/facebookincubator/dynolog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:11:47.0045905Z Submodule 'libkineto/third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:11:47.0049428Z Submodule 'libkineto/third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:11:47.0080893Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog'... 2024-08-20T22:11:47.5568585Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/fmt'... 2024-08-20T22:11:48.8026752Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/googletest'... 2024-08-20T22:11:49.9563743Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out '7d04a0053a845370ae06ce317a22a48e9edcc74e' 2024-08-20T22:11:49.9584717Z Submodule 'third_party/DCGM' (https://github.com/NVIDIA/DCGM.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:11:49.9588054Z Submodule 'third_party/cpr' (https://github.com/libcpr/cpr.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:11:49.9592085Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:11:49.9595720Z Submodule 'third_party/gflags' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:11:49.9599428Z Submodule 'third_party/glog' (https://github.com/google/glog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:11:49.9603265Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:11:49.9606906Z Submodule 'third_party/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:11:49.9610661Z Submodule 'third_party/pfs' (https://github.com/dtrugman/pfs.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:11:49.9642471Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM'... 2024-08-20T22:11:50.7881310Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/cpr'... 2024-08-20T22:11:51.1750051Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/fmt'... 2024-08-20T22:11:52.3789850Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags'... 2024-08-20T22:11:52.6845851Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/glog'... 2024-08-20T22:11:53.2621880Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/googletest'... 2024-08-20T22:11:54.3341483Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/json'... 2024-08-20T22:12:00.7584656Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/pfs'... 2024-08-20T22:12:01.3038378Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2024-08-20T22:12:01.3275962Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2024-08-20T22:12:01.3701761Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2024-08-20T22:12:01.3871596Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2024-08-20T22:12:01.3891289Z Submodule 'doc' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:01.3922202Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc'... 2024-08-20T22:12:01.7918741Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2024-08-20T22:12:01.8147907Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2024-08-20T22:12:01.8625175Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '58d77fa8070e8cec2dc1ed015d66b454c8d78850' 2024-08-20T22:12:01.9853946Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2024-08-20T22:12:02.0055106Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2024-08-20T22:12:02.0509156Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '0041a40c1350ba702d475b9c4ad62da77caea164' 2024-08-20T22:12:02.1160031Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '7aca84427f224eeed3144123d5230d5871e93347' 2024-08-20T22:12:02.1623364Z Submodule path 'third_party/mimalloc': checked out 'b66e3214d8a104669c2ec05ae91ebc26a8f5ab78' 2024-08-20T22:12:02.1929232Z Submodule path 'third_party/nccl/nccl': checked out 'ab2b89c4c339bd7f816fbc114a4b05d386b66290' 2024-08-20T22:12:02.3182362Z Submodule path 'third_party/nlohmann': checked out '87cda1d6646592ac5866dc703c8e1839046a6806' 2024-08-20T22:12:02.8311489Z Submodule path 'third_party/onnx': checked out '3bf92c03a9f27eba3bda1e5b9e63ea20ec213557' 2024-08-20T22:12:02.8349608Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:02.8352194Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:02.8384643Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/benchmark'... 2024-08-20T22:12:03.4881713Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/pybind11'... 2024-08-20T22:12:04.5569833Z Submodule path 'third_party/onnx/third_party/benchmark': checked out '2dd015dfef425c866d9a43f2c67d8b52d709acb6' 2024-08-20T22:12:04.5969813Z Submodule path 'third_party/onnx/third_party/pybind11': checked out '5b0a6fc2017fcc176545afe3e09c9f9885283242' 2024-08-20T22:12:04.6873160Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2024-08-20T22:12:04.6897010Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark) registered for path 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:04.6900528Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:04.6904239Z Submodule 'third_party/ms-gsl' (https://github.com/microsoft/GSL) registered for path 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:04.6908342Z Submodule 'third_party/nlohmann-json' (https://github.com/nlohmann/json) registered for path 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:04.6912401Z Submodule 'third_party/opentelemetry-proto' (https://github.com/open-telemetry/opentelemetry-proto) registered for path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:04.6916364Z Submodule 'third_party/opentracing-cpp' (https://github.com/opentracing/opentracing-cpp.git) registered for path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:04.6920425Z Submodule 'third_party/prometheus-cpp' (https://github.com/jupp0r/prometheus-cpp) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:04.6924375Z Submodule 'tools/vcpkg' (https://github.com/Microsoft/vcpkg) registered for path 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:04.6956429Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/benchmark'... 2024-08-20T22:12:05.1497337Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/googletest'... 2024-08-20T22:12:06.1482457Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/ms-gsl'... 2024-08-20T22:12:06.4655073Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/nlohmann-json'... 2024-08-20T22:12:12.7121549Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentelemetry-proto'... 2024-08-20T22:12:12.9920429Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentracing-cpp'... 2024-08-20T22:12:13.1919010Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp'... 2024-08-20T22:12:13.4824464Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/tools/vcpkg'... 2024-08-20T22:12:20.2369878Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2024-08-20T22:12:20.2842741Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2024-08-20T22:12:20.3030400Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2024-08-20T22:12:20.4262625Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2024-08-20T22:12:20.4425940Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2024-08-20T22:12:20.4612017Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2024-08-20T22:12:20.4809440Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2024-08-20T22:12:20.4828244Z Submodule 'civetweb' (https://github.com/civetweb/civetweb.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:20.4831702Z Submodule 'googletest' (https://github.com/google/googletest.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:20.4864804Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb'... 2024-08-20T22:12:22.3683997Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest'... 2024-08-20T22:12:23.7134870Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2024-08-20T22:12:23.7675694Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-08-20T22:12:24.4368261Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2024-08-20T22:12:24.4514275Z Submodule path 'third_party/pocketfft': checked out '9d3ab05a7fffbc71a492bc6a17be034e83e8f0fe' 2024-08-20T22:12:24.7592671Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2024-08-20T22:12:24.7617272Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:24.7620494Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:24.7651318Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/benchmark'... 2024-08-20T22:12:25.1768950Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/googletest'... 2024-08-20T22:12:26.2437085Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2024-08-20T22:12:26.3250012Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2024-08-20T22:12:26.3366996Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2024-08-20T22:12:26.3522114Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2024-08-20T22:12:26.3972297Z Submodule path 'third_party/pybind11': checked out '941f45bcb51457884fa1afd6e24a67377d70f75c' 2024-08-20T22:12:26.4311470Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2024-08-20T22:12:26.4800430Z Submodule path 'third_party/sleef': checked out '60e76d2bce17d278b439d9da17177c8f957a9e9b' 2024-08-20T22:12:26.5134850Z Submodule path 'third_party/tensorpipe': checked out '52791a2fd214b2a9dc5759d36725909c1daa7f2e' 2024-08-20T22:12:26.5155969Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:26.5159041Z Submodule 'third_party/libnop' (https://github.com/google/libnop.git) registered for path 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:26.5162836Z Submodule 'third_party/libuv' (https://github.com/libuv/libuv.git) registered for path 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:26.5166200Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:26.5198147Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/googletest'... 2024-08-20T22:12:27.6052678Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libnop'... 2024-08-20T22:12:27.8320491Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libuv'... 2024-08-20T22:12:29.0467595Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11'... 2024-08-20T22:12:30.0148255Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2024-08-20T22:12:30.0344155Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2024-08-20T22:12:30.1130698Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '1dff88e5161cba5c59276d2070d2e304e4dcb242' 2024-08-20T22:12:30.1474280Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2024-08-20T22:12:30.1493037Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:30.1523825Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11/tools/clang'... 2024-08-20T22:12:30.3670274Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-08-20T22:12:30.3717652Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2024-08-20T22:12:30.4082907Z Entering 'android/libs/fbjni' 2024-08-20T22:12:30.4134789Z Entering 'third_party/FP16' 2024-08-20T22:12:30.4187673Z Entering 'third_party/FXdiv' 2024-08-20T22:12:30.4243961Z Entering 'third_party/NNPACK' 2024-08-20T22:12:30.4297035Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T22:12:30.4349421Z Entering 'third_party/XNNPACK' 2024-08-20T22:12:30.4420069Z Entering 'third_party/benchmark' 2024-08-20T22:12:30.4476980Z Entering 'third_party/cpp-httplib' 2024-08-20T22:12:30.4528518Z Entering 'third_party/cpuinfo' 2024-08-20T22:12:30.4581132Z Entering 'third_party/cudnn_frontend' 2024-08-20T22:12:30.4633189Z Entering 'third_party/cutlass' 2024-08-20T22:12:30.4693655Z Entering 'third_party/eigen' 2024-08-20T22:12:30.4747926Z Entering 'third_party/fbgemm' 2024-08-20T22:12:30.4802086Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:12:30.4853029Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:12:30.4904017Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:12:30.4961439Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:12:30.5011759Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:12:30.5064610Z Entering 'third_party/flatbuffers' 2024-08-20T22:12:30.5126700Z Entering 'third_party/fmt' 2024-08-20T22:12:30.5178859Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:12:30.5230357Z Entering 'third_party/gloo' 2024-08-20T22:12:30.5286992Z Entering 'third_party/googletest' 2024-08-20T22:12:30.5339087Z Entering 'third_party/ideep' 2024-08-20T22:12:30.5389767Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T22:12:30.5447557Z Entering 'third_party/ittapi' 2024-08-20T22:12:30.5500392Z Entering 'third_party/kineto' 2024-08-20T22:12:30.5556166Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:12:30.5607937Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:12:30.5660110Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:12:30.5712024Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:12:30.5762416Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:12:30.5816229Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:30.5872912Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:12:30.5927773Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:12:30.5979178Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:12:30.6030803Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:12:30.6084873Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:12:30.6135682Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:12:30.6189519Z Entering 'third_party/mimalloc' 2024-08-20T22:12:30.6241885Z Entering 'third_party/nccl/nccl' 2024-08-20T22:12:30.6298319Z Entering 'third_party/nlohmann' 2024-08-20T22:12:30.6352014Z Entering 'third_party/onnx' 2024-08-20T22:12:30.6418611Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:30.6472610Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:30.6529555Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T22:12:30.6583698Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:30.6635189Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:30.6683727Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:30.6733475Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:30.6784763Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:30.6834432Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:30.6885286Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:30.6933085Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:30.6985198Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:30.7038719Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:30.7116665Z Entering 'third_party/pocketfft' 2024-08-20T22:12:30.7172440Z Entering 'third_party/protobuf' 2024-08-20T22:12:30.7227274Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:30.7276771Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:30.7330452Z Entering 'third_party/psimd' 2024-08-20T22:12:30.7387440Z Entering 'third_party/pthreadpool' 2024-08-20T22:12:30.7439258Z Entering 'third_party/pybind11' 2024-08-20T22:12:30.7491999Z Entering 'third_party/python-peachpy' 2024-08-20T22:12:30.7543925Z Entering 'third_party/sleef' 2024-08-20T22:12:30.7595221Z Entering 'third_party/tensorpipe' 2024-08-20T22:12:30.7645565Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:30.7696413Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:30.7746434Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:30.7797013Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:30.7845228Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:30.7922918Z ##[endgroup] 2024-08-20T22:12:30.7925305Z ##[group]Persisting credentials for submodules 2024-08-20T22:12:30.7927832Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2024-08-20T22:12:30.8297826Z Entering 'android/libs/fbjni' 2024-08-20T22:12:30.8366476Z Entering 'third_party/FP16' 2024-08-20T22:12:30.8435946Z Entering 'third_party/FXdiv' 2024-08-20T22:12:30.8503977Z Entering 'third_party/NNPACK' 2024-08-20T22:12:30.8577261Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T22:12:30.8643674Z Entering 'third_party/XNNPACK' 2024-08-20T22:12:30.8726768Z Entering 'third_party/benchmark' 2024-08-20T22:12:30.8792924Z Entering 'third_party/cpp-httplib' 2024-08-20T22:12:30.8864618Z Entering 'third_party/cpuinfo' 2024-08-20T22:12:30.8936783Z Entering 'third_party/cudnn_frontend' 2024-08-20T22:12:30.9005180Z Entering 'third_party/cutlass' 2024-08-20T22:12:30.9080347Z Entering 'third_party/eigen' 2024-08-20T22:12:30.9149419Z Entering 'third_party/fbgemm' 2024-08-20T22:12:30.9216456Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:12:30.9284057Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:12:30.9350883Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:12:30.9425017Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:12:30.9494473Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:12:30.9563586Z Entering 'third_party/flatbuffers' 2024-08-20T22:12:30.9634161Z Entering 'third_party/fmt' 2024-08-20T22:12:30.9701080Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:12:30.9773181Z Entering 'third_party/gloo' 2024-08-20T22:12:30.9845923Z Entering 'third_party/googletest' 2024-08-20T22:12:30.9915036Z Entering 'third_party/ideep' 2024-08-20T22:12:30.9980447Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T22:12:31.0056312Z Entering 'third_party/ittapi' 2024-08-20T22:12:31.0123412Z Entering 'third_party/kineto' 2024-08-20T22:12:31.0195845Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:12:31.0261432Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:12:31.0329591Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:12:31.0396952Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:12:31.0464123Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:12:31.0528749Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:31.0600319Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:12:31.0667185Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:12:31.0735453Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:12:31.0803147Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:12:31.0881439Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:12:31.0947185Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:12:31.1016737Z Entering 'third_party/mimalloc' 2024-08-20T22:12:31.1088125Z Entering 'third_party/nccl/nccl' 2024-08-20T22:12:31.1156156Z Entering 'third_party/nlohmann' 2024-08-20T22:12:31.1225029Z Entering 'third_party/onnx' 2024-08-20T22:12:31.1306781Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:31.1378384Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:31.1450309Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T22:12:31.1521018Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:31.1587405Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:31.1652217Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:31.1717901Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:31.1785086Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:31.1850650Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:31.1916980Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:31.1979415Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:31.2047198Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:31.2119101Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:31.2205609Z Entering 'third_party/pocketfft' 2024-08-20T22:12:31.2272307Z Entering 'third_party/protobuf' 2024-08-20T22:12:31.2345551Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:31.2414750Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:31.2485060Z Entering 'third_party/psimd' 2024-08-20T22:12:31.2552957Z Entering 'third_party/pthreadpool' 2024-08-20T22:12:31.2619039Z Entering 'third_party/pybind11' 2024-08-20T22:12:31.2686790Z Entering 'third_party/python-peachpy' 2024-08-20T22:12:31.2753853Z Entering 'third_party/sleef' 2024-08-20T22:12:31.2823715Z Entering 'third_party/tensorpipe' 2024-08-20T22:12:31.2888558Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:31.2953559Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:31.3019439Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:31.3085414Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:31.3148875Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:31.3239743Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2024-08-20T22:12:31.3599903Z Entering 'android/libs/fbjni' 2024-08-20T22:12:31.3660353Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2024-08-20T22:12:31.3682214Z Entering 'third_party/FP16' 2024-08-20T22:12:31.3745233Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2024-08-20T22:12:31.3764771Z Entering 'third_party/FXdiv' 2024-08-20T22:12:31.3826600Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2024-08-20T22:12:31.3847575Z Entering 'third_party/NNPACK' 2024-08-20T22:12:31.3909048Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2024-08-20T22:12:31.3931220Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T22:12:31.3992936Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2024-08-20T22:12:31.4014299Z Entering 'third_party/XNNPACK' 2024-08-20T22:12:31.4076520Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2024-08-20T22:12:31.4113427Z Entering 'third_party/benchmark' 2024-08-20T22:12:31.4174916Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2024-08-20T22:12:31.4196088Z Entering 'third_party/cpp-httplib' 2024-08-20T22:12:31.4257359Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2024-08-20T22:12:31.4278873Z Entering 'third_party/cpuinfo' 2024-08-20T22:12:31.4339606Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2024-08-20T22:12:31.4361128Z Entering 'third_party/cudnn_frontend' 2024-08-20T22:12:31.4422804Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2024-08-20T22:12:31.4443820Z Entering 'third_party/cutlass' 2024-08-20T22:12:31.4506615Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2024-08-20T22:12:31.4536024Z Entering 'third_party/eigen' 2024-08-20T22:12:31.4599543Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/eigen/config remote.origin.url 2024-08-20T22:12:31.4623633Z Entering 'third_party/fbgemm' 2024-08-20T22:12:31.4686924Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2024-08-20T22:12:31.4706997Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:12:31.4767904Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/asmjit/config remote.origin.url 2024-08-20T22:12:31.4788631Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:12:31.4849492Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cpuinfo/config remote.origin.url 2024-08-20T22:12:31.4871176Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:12:31.4937546Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cutlass/config remote.origin.url 2024-08-20T22:12:31.4964855Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:12:31.5028117Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.5048966Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:12:31.5111749Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/hipify_torch/config remote.origin.url 2024-08-20T22:12:31.5134897Z Entering 'third_party/flatbuffers' 2024-08-20T22:12:31.5197644Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2024-08-20T22:12:31.5221435Z Entering 'third_party/fmt' 2024-08-20T22:12:31.5285055Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2024-08-20T22:12:31.5307162Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:12:31.5370072Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2024-08-20T22:12:31.5391589Z Entering 'third_party/gloo' 2024-08-20T22:12:31.5452684Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2024-08-20T22:12:31.5474042Z Entering 'third_party/googletest' 2024-08-20T22:12:31.5535620Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.5556808Z Entering 'third_party/ideep' 2024-08-20T22:12:31.5619143Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2024-08-20T22:12:31.5637927Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T22:12:31.5704110Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2024-08-20T22:12:31.5733319Z Entering 'third_party/ittapi' 2024-08-20T22:12:31.5794670Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2024-08-20T22:12:31.5815688Z Entering 'third_party/kineto' 2024-08-20T22:12:31.5876965Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2024-08-20T22:12:31.5896882Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:12:31.5958680Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2024-08-20T22:12:31.5977792Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:12:31.6039509Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2024-08-20T22:12:31.6061739Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:12:31.6126880Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2024-08-20T22:12:31.6147366Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:12:31.6208868Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2024-08-20T22:12:31.6229277Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:12:31.6292043Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2024-08-20T22:12:31.6310007Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:31.6373656Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2024-08-20T22:12:31.6396302Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:12:31.6462335Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2024-08-20T22:12:31.6485701Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:12:31.6548429Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.6569221Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:12:31.6629999Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2024-08-20T22:12:31.6650910Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:12:31.6711594Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2024-08-20T22:12:31.6737312Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:12:31.6798298Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2024-08-20T22:12:31.6819667Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:12:31.6881416Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.6904244Z Entering 'third_party/mimalloc' 2024-08-20T22:12:31.6968171Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2024-08-20T22:12:31.6988819Z Entering 'third_party/nccl/nccl' 2024-08-20T22:12:31.7050957Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nccl/nccl/config remote.origin.url 2024-08-20T22:12:31.7073457Z Entering 'third_party/nlohmann' 2024-08-20T22:12:31.7135220Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2024-08-20T22:12:31.7158255Z Entering 'third_party/onnx' 2024-08-20T22:12:31.7221644Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2024-08-20T22:12:31.7256129Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:31.7316310Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/benchmark/config remote.origin.url 2024-08-20T22:12:31.7338140Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:31.7401000Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-08-20T22:12:31.7426860Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T22:12:31.7490059Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2024-08-20T22:12:31.7510935Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:31.7572753Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2024-08-20T22:12:31.7592162Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:31.7652422Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.7673118Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:31.7738389Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2024-08-20T22:12:31.7757907Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:31.7818864Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2024-08-20T22:12:31.7839921Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:31.7900211Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2024-08-20T22:12:31.7920536Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:31.7981744Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2024-08-20T22:12:31.8001335Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:31.8061294Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2024-08-20T22:12:31.8081464Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:31.8145637Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2024-08-20T22:12:31.8166728Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:31.8229205Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2024-08-20T22:12:31.8252928Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:31.8313508Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2024-08-20T22:12:31.8354141Z Entering 'third_party/pocketfft' 2024-08-20T22:12:31.8418647Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2024-08-20T22:12:31.8439864Z Entering 'third_party/protobuf' 2024-08-20T22:12:31.8502302Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2024-08-20T22:12:31.8525577Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:31.8585911Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2024-08-20T22:12:31.8605748Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:31.8666819Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.8692044Z Entering 'third_party/psimd' 2024-08-20T22:12:31.8756029Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2024-08-20T22:12:31.8776992Z Entering 'third_party/pthreadpool' 2024-08-20T22:12:31.8838856Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2024-08-20T22:12:31.8860411Z Entering 'third_party/pybind11' 2024-08-20T22:12:31.8922877Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2024-08-20T22:12:31.8944499Z Entering 'third_party/python-peachpy' 2024-08-20T22:12:31.9007229Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2024-08-20T22:12:31.9028759Z Entering 'third_party/sleef' 2024-08-20T22:12:31.9091219Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2024-08-20T22:12:31.9112789Z Entering 'third_party/tensorpipe' 2024-08-20T22:12:31.9175983Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2024-08-20T22:12:31.9196078Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:31.9255931Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2024-08-20T22:12:31.9275534Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:31.9334774Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2024-08-20T22:12:31.9355051Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:31.9417052Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2024-08-20T22:12:31.9436622Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:31.9498762Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2024-08-20T22:12:31.9516522Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:31.9579027Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-08-20T22:12:32.0449338Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2024-08-20T22:12:32.0816763Z Entering 'android/libs/fbjni' 2024-08-20T22:12:32.0867409Z Entering 'third_party/FP16' 2024-08-20T22:12:32.0919387Z Entering 'third_party/FXdiv' 2024-08-20T22:12:32.0970627Z Entering 'third_party/NNPACK' 2024-08-20T22:12:32.1021814Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T22:12:32.1073408Z Entering 'third_party/XNNPACK' 2024-08-20T22:12:32.1139475Z Entering 'third_party/benchmark' 2024-08-20T22:12:32.1191555Z Entering 'third_party/cpp-httplib' 2024-08-20T22:12:32.1246162Z Entering 'third_party/cpuinfo' 2024-08-20T22:12:32.1299699Z Entering 'third_party/cudnn_frontend' 2024-08-20T22:12:32.1350649Z Entering 'third_party/cutlass' 2024-08-20T22:12:32.1410151Z Entering 'third_party/eigen' 2024-08-20T22:12:32.1463675Z Entering 'third_party/fbgemm' 2024-08-20T22:12:32.1513246Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:12:32.1568001Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:12:32.1620710Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:12:32.1677663Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:12:32.1727383Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:12:32.1780980Z Entering 'third_party/flatbuffers' 2024-08-20T22:12:32.1835860Z Entering 'third_party/fmt' 2024-08-20T22:12:32.1889271Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:12:32.1942012Z Entering 'third_party/gloo' 2024-08-20T22:12:32.1995207Z Entering 'third_party/googletest' 2024-08-20T22:12:32.2046252Z Entering 'third_party/ideep' 2024-08-20T22:12:32.2096834Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T22:12:32.2157725Z Entering 'third_party/ittapi' 2024-08-20T22:12:32.2208238Z Entering 'third_party/kineto' 2024-08-20T22:12:32.2257876Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:12:32.2307857Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:12:32.2358985Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:12:32.2409473Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:12:32.2458826Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:12:32.2507421Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:32.2561613Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:12:32.2612208Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:12:32.2661553Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:12:32.2712474Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:12:32.2765023Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:12:32.2815797Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:12:32.2870022Z Entering 'third_party/mimalloc' 2024-08-20T22:12:32.2922925Z Entering 'third_party/nccl/nccl' 2024-08-20T22:12:32.2975231Z Entering 'third_party/nlohmann' 2024-08-20T22:12:32.3028522Z Entering 'third_party/onnx' 2024-08-20T22:12:32.3093350Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:32.3147719Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:32.3209223Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T22:12:32.3259696Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:32.3309314Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:32.3359195Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:32.3409789Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:32.3460847Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:32.3510198Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:32.3564531Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:32.3613933Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:32.3667895Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:32.3721499Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:32.3801383Z Entering 'third_party/pocketfft' 2024-08-20T22:12:32.3856004Z Entering 'third_party/protobuf' 2024-08-20T22:12:32.3909178Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:32.3964299Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:32.4018915Z Entering 'third_party/psimd' 2024-08-20T22:12:32.4072178Z Entering 'third_party/pthreadpool' 2024-08-20T22:12:32.4122815Z Entering 'third_party/pybind11' 2024-08-20T22:12:32.4173667Z Entering 'third_party/python-peachpy' 2024-08-20T22:12:32.4223884Z Entering 'third_party/sleef' 2024-08-20T22:12:32.4274910Z Entering 'third_party/tensorpipe' 2024-08-20T22:12:32.4325615Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:32.4380179Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:32.4429543Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:32.4485157Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:32.4532846Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:32.4607289Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2024-08-20T22:12:32.4970611Z Entering 'android/libs/fbjni' 2024-08-20T22:12:32.5022207Z Entering 'third_party/FP16' 2024-08-20T22:12:32.5075168Z Entering 'third_party/FXdiv' 2024-08-20T22:12:32.5126931Z Entering 'third_party/NNPACK' 2024-08-20T22:12:32.5179629Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T22:12:32.5231982Z Entering 'third_party/XNNPACK' 2024-08-20T22:12:32.5299535Z Entering 'third_party/benchmark' 2024-08-20T22:12:32.5352373Z Entering 'third_party/cpp-httplib' 2024-08-20T22:12:32.5407208Z Entering 'third_party/cpuinfo' 2024-08-20T22:12:32.5459384Z Entering 'third_party/cudnn_frontend' 2024-08-20T22:12:32.5513468Z Entering 'third_party/cutlass' 2024-08-20T22:12:32.5573469Z Entering 'third_party/eigen' 2024-08-20T22:12:32.5628205Z Entering 'third_party/fbgemm' 2024-08-20T22:12:32.5684327Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T22:12:32.5734502Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T22:12:32.5785435Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T22:12:32.5842359Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T22:12:32.5892419Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T22:12:32.5945191Z Entering 'third_party/flatbuffers' 2024-08-20T22:12:32.6000318Z Entering 'third_party/fmt' 2024-08-20T22:12:32.6052411Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T22:12:32.6107844Z Entering 'third_party/gloo' 2024-08-20T22:12:32.6159088Z Entering 'third_party/googletest' 2024-08-20T22:12:32.6210877Z Entering 'third_party/ideep' 2024-08-20T22:12:32.6288143Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T22:12:32.6346062Z Entering 'third_party/ittapi' 2024-08-20T22:12:32.6397859Z Entering 'third_party/kineto' 2024-08-20T22:12:32.6448960Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T22:12:32.6498960Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T22:12:32.6551586Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T22:12:32.6602940Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T22:12:32.6654351Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T22:12:32.6707575Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T22:12:32.6761970Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T22:12:32.6812576Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T22:12:32.6862346Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T22:12:32.6913368Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T22:12:32.6967482Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T22:12:32.7018214Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T22:12:32.7073961Z Entering 'third_party/mimalloc' 2024-08-20T22:12:32.7126251Z Entering 'third_party/nccl/nccl' 2024-08-20T22:12:32.7177972Z Entering 'third_party/nlohmann' 2024-08-20T22:12:32.7230602Z Entering 'third_party/onnx' 2024-08-20T22:12:32.7296288Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T22:12:32.7347417Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T22:12:32.7404086Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T22:12:32.7456859Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T22:12:32.7507348Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T22:12:32.7557630Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T22:12:32.7608989Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T22:12:32.7660131Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T22:12:32.7710448Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T22:12:32.7760121Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T22:12:32.7808259Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T22:12:32.7861021Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T22:12:32.7914765Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T22:12:32.7985984Z Entering 'third_party/pocketfft' 2024-08-20T22:12:32.8038953Z Entering 'third_party/protobuf' 2024-08-20T22:12:32.8092861Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T22:12:32.8143382Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T22:12:32.8200732Z Entering 'third_party/psimd' 2024-08-20T22:12:32.8252443Z Entering 'third_party/pthreadpool' 2024-08-20T22:12:32.8306501Z Entering 'third_party/pybind11' 2024-08-20T22:12:32.8358084Z Entering 'third_party/python-peachpy' 2024-08-20T22:12:32.8408595Z Entering 'third_party/sleef' 2024-08-20T22:12:32.8459740Z Entering 'third_party/tensorpipe' 2024-08-20T22:12:32.8509917Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T22:12:32.8559440Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T22:12:32.8617094Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T22:12:32.8667201Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T22:12:32.8718358Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T22:12:32.8790137Z ##[endgroup] 2024-08-20T22:12:32.8837744Z [command]/usr/bin/git log -1 --format='%H' 2024-08-20T22:12:32.8873977Z '40ec5f6ddd9787aca0449b24128343ff4c4a88b3' 2024-08-20T22:12:32.9089886Z Prepare all required actions 2024-08-20T22:12:32.9090341Z Getting action download info 2024-08-20T22:12:33.0549194Z ##[group]Run ./.github/actions/setup-linux 2024-08-20T22:12:33.0549609Z env: 2024-08-20T22:12:33.0549887Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:33.0550224Z ##[endgroup] 2024-08-20T22:12:33.0609725Z ##[group]Run set -euo pipefail 2024-08-20T22:12:33.0610167Z set -euo pipefail 2024-08-20T22:12:33.0610739Z function get_ec2_metadata() { 2024-08-20T22:12:33.0611296Z  # Pulled from instance metadata endpoint for EC2 2024-08-20T22:12:33.0612172Z  # see https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/instancedata-data-retrieval.html 2024-08-20T22:12:33.0612925Z  category=$1 2024-08-20T22:12:33.0613458Z  # If it is GCP runner (runner name contains gcp), do not run this 2024-08-20T22:12:33.0614097Z  runner_name_str=i-0b43e2cc0d7540218 2024-08-20T22:12:33.0614591Z  if [[ -f /.inarc ]]; then 2024-08-20T22:12:33.0615076Z  echo "ARC Runner, no info on ec2 metadata" 2024-08-20T22:12:33.0615638Z  elif [[ $runner_name_str == *"gcp"* ]]; then 2024-08-20T22:12:33.0616314Z  echo "Runner is from Google Cloud Platform, No info on ec2 metadata" 2024-08-20T22:12:33.0616910Z  else 2024-08-20T22:12:33.0617392Z  curl -fsSL "http://169.254.169.254/latest/meta-data/${category}" 2024-08-20T22:12:33.0617966Z  fi 2024-08-20T22:12:33.0618253Z } 2024-08-20T22:12:33.0618608Z echo "ami-id: $(get_ec2_metadata ami-id)" 2024-08-20T22:12:33.0619186Z echo "instance-id: $(get_ec2_metadata instance-id)" 2024-08-20T22:12:33.0619826Z echo "instance-type: $(get_ec2_metadata instance-type)" 2024-08-20T22:12:33.0620380Z echo "system info $(uname -a)" 2024-08-20T22:12:33.0630625Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:33.0631123Z env: 2024-08-20T22:12:33.0631408Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:33.0631758Z ##[endgroup] 2024-08-20T22:12:33.0751649Z ami-id: ami-06c68f701d8090592 2024-08-20T22:12:33.0810885Z instance-id: i-0b43e2cc0d7540218 2024-08-20T22:12:33.0867380Z instance-type: g5.4xlarge 2024-08-20T22:12:33.0882127Z system info Linux ip-10-0-70-243.ec2.internal 6.1.94-99.176.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Jun 18 14:57:56 UTC 2024 x86_64 x86_64 x86_64 GNU/Linux 2024-08-20T22:12:33.0905951Z ##[group]Run echo "IN_ARC_RUNNER=$([ -f /.inarc ] && echo true || echo false)" >> $GITHUB_OUTPUT 2024-08-20T22:12:33.0906869Z echo "IN_ARC_RUNNER=$([ -f /.inarc ] && echo true || echo false)" >> $GITHUB_OUTPUT 2024-08-20T22:12:33.0916663Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:33.0917162Z env: 2024-08-20T22:12:33.0917441Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:33.0917791Z ##[endgroup] 2024-08-20T22:12:33.0993615Z ##[group]Run if systemctl is-active --quiet docker; then 2024-08-20T22:12:33.0994213Z if systemctl is-active --quiet docker; then 2024-08-20T22:12:33.0994746Z  echo "Docker daemon is running..."; 2024-08-20T22:12:33.0995191Z else 2024-08-20T22:12:33.0995672Z  echo "Starting docker deamon..." && sudo systemctl start docker; 2024-08-20T22:12:33.0996237Z fi 2024-08-20T22:12:33.1004517Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:33.1005018Z env: 2024-08-20T22:12:33.1005290Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:33.1005626Z ##[endgroup] 2024-08-20T22:12:33.1094142Z Docker daemon is running... 2024-08-20T22:12:33.1143526Z ##[group]Run nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482 2024-08-20T22:12:33.1144084Z with: 2024-08-20T22:12:33.1144358Z shell: bash 2024-08-20T22:12:33.1144652Z timeout_minutes: 5 2024-08-20T22:12:33.1144979Z max_attempts: 3 2024-08-20T22:12:33.1145298Z retry_wait_seconds: 30 2024-08-20T22:12:33.1148347Z command: AWS_ACCOUNT_ID=$(aws sts get-caller-identity|grep Account|cut -f4 -d\") aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" # For LF Runners we need to make sure we also login to Meta's ECR docker registry too. META_AWS_ACCOUNT_ID=308535385114 if [ "$AWS_ACCOUNT_ID" != "$META_AWS_ACCOUNT_ID" ] ; then aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$META_AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" fi 2024-08-20T22:12:33.1151397Z polling_interval_seconds: 1 2024-08-20T22:12:33.1151770Z warning_on_retry: true 2024-08-20T22:12:33.1152157Z continue_on_error: false 2024-08-20T22:12:33.1152503Z env: 2024-08-20T22:12:33.1152781Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:33.1153128Z AWS_RETRY_MODE: standard 2024-08-20T22:12:33.1153477Z AWS_MAX_ATTEMPTS: 5 2024-08-20T22:12:33.1153821Z AWS_DEFAULT_REGION: us-east-1 2024-08-20T22:12:33.1154195Z ##[endgroup] 2024-08-20T22:12:34.2630130Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2024-08-20T22:12:34.2630944Z Configure a credential helper to remove this warning. See 2024-08-20T22:12:34.2633663Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-08-20T22:12:34.2634366Z 2024-08-20T22:12:34.2634529Z Login Succeeded 2024-08-20T22:12:35.1717141Z Command completed after 1 attempt(s). 2024-08-20T22:12:35.1784154Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-08-20T22:12:35.1784849Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-08-20T22:12:35.1785484Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-08-20T22:12:35.1795365Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:35.1795849Z env: 2024-08-20T22:12:35.1796128Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.1796469Z ##[endgroup] 2024-08-20T22:12:35.1890371Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-08-20T22:12:35.1891111Z # ignore expansion of "docker ps -q" since it could be empty 2024-08-20T22:12:35.1891689Z # shellcheck disable=SC2046 2024-08-20T22:12:35.1892147Z docker stop $(docker ps -q) || true 2024-08-20T22:12:35.1892656Z # Prune all of the docker images 2024-08-20T22:12:35.1893115Z docker system prune -af 2024-08-20T22:12:35.1901321Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:35.1901809Z env: 2024-08-20T22:12:35.1902116Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.1902481Z ##[endgroup] 2024-08-20T22:12:35.2244780Z "docker stop" requires at least 1 argument. 2024-08-20T22:12:35.2245512Z See 'docker stop --help'. 2024-08-20T22:12:35.2245754Z 2024-08-20T22:12:35.2245971Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2024-08-20T22:12:35.2246329Z 2024-08-20T22:12:35.2246482Z Stop one or more running containers 2024-08-20T22:12:35.2453914Z Total reclaimed space: 0B 2024-08-20T22:12:35.2501875Z ##[group]Run set +e 2024-08-20T22:12:35.2502287Z set +e 2024-08-20T22:12:35.2502583Z set -x 2024-08-20T22:12:35.2502878Z  2024-08-20T22:12:35.2503192Z PT_DOMAIN=download.pytorch.org 2024-08-20T22:12:35.2503945Z # TODO: Flaky access to download.pytorch.org https://github.com/pytorch/pytorch/issues/100400, 2024-08-20T22:12:35.2504993Z # cleaning this up once the issue is fixed. There are more than one resolved IP here, the last 2024-08-20T22:12:35.2505715Z # one is returned at random 2024-08-20T22:12:35.2506242Z RESOLVED_IP=$(dig -4 +short "${PT_DOMAIN}" | tail -n1) 2024-08-20T22:12:35.2506745Z  2024-08-20T22:12:35.2507059Z if [ -z "${RESOLVED_IP}" ]; then 2024-08-20T22:12:35.2507658Z  echo "Couldn't resolve ${PT_DOMAIN}, retrying with Google DNS..." 2024-08-20T22:12:35.2508596Z  RESOLVED_IP=$(dig -4 +short "${PT_DOMAIN}" @8.8.8.8 | tail -n1) 2024-08-20T22:12:35.2509148Z  2024-08-20T22:12:35.2509473Z  if [ -z "${RESOLVED_IP}" ]; then 2024-08-20T22:12:35.2510009Z  echo "Couldn't resolve ${PT_DOMAIN}, exiting..." 2024-08-20T22:12:35.2510518Z  exit 1 2024-08-20T22:12:35.2510834Z  fi 2024-08-20T22:12:35.2511115Z fi 2024-08-20T22:12:35.2511396Z  2024-08-20T22:12:35.2511891Z if grep -r "${PT_DOMAIN}" /etc/hosts; then 2024-08-20T22:12:35.2512396Z  # Clean up any old records first 2024-08-20T22:12:35.2512897Z  sudo sed -i "/${PT_DOMAIN}/d" /etc/hosts 2024-08-20T22:12:35.2513344Z fi 2024-08-20T22:12:35.2513614Z  2024-08-20T22:12:35.2514040Z echo "${RESOLVED_IP} ${PT_DOMAIN}" | sudo tee -a /etc/hosts 2024-08-20T22:12:35.2514576Z cat /etc/hosts 2024-08-20T22:12:35.2523686Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:35.2524180Z env: 2024-08-20T22:12:35.2524468Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.2524802Z ##[endgroup] 2024-08-20T22:12:35.2553659Z + PT_DOMAIN=download.pytorch.org 2024-08-20T22:12:35.2560074Z ++ dig -4 +short download.pytorch.org 2024-08-20T22:12:35.2560544Z ++ tail -n1 2024-08-20T22:12:35.3006831Z + RESOLVED_IP=18.160.10.22 2024-08-20T22:12:35.3007373Z + '[' -z 18.160.10.22 ']' 2024-08-20T22:12:35.3007806Z + grep -r download.pytorch.org /etc/hosts 2024-08-20T22:12:35.3025622Z + echo '18.160.10.22 download.pytorch.org' 2024-08-20T22:12:35.3026376Z + sudo tee -a /etc/hosts 2024-08-20T22:12:35.5244157Z 18.160.10.22 download.pytorch.org 2024-08-20T22:12:35.5266679Z + cat /etc/hosts 2024-08-20T22:12:35.5277637Z 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 2024-08-20T22:12:35.5289576Z ::1 localhost6 localhost6.localdomain6 2024-08-20T22:12:35.5290306Z 18.160.10.22 download.pytorch.org 2024-08-20T22:12:35.5441648Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2024-08-20T22:12:35.5442264Z with: 2024-08-20T22:12:35.5443227Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5444289Z docker-build-dir: .ci/docker 2024-08-20T22:12:35.5444670Z working-directory: . 2024-08-20T22:12:35.5445127Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:35.5445663Z force-push: false 2024-08-20T22:12:35.5445968Z env: 2024-08-20T22:12:35.5446239Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.5446580Z ##[endgroup] 2024-08-20T22:12:35.5465091Z ##[group]Run set -ex 2024-08-20T22:12:35.5465448Z set -ex 2024-08-20T22:12:35.5465751Z  2024-08-20T22:12:35.5466302Z # If the docker build directory or the build script doesn't exist, the action will 2024-08-20T22:12:35.5467293Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2024-08-20T22:12:35.5468374Z # job could then download the pre-built image as usual 2024-08-20T22:12:35.5469097Z if [[ ! -d "${DOCKER_BUILD_DIR}" ]] || [[ ! -f "${DOCKER_BUILD_DIR}/build.sh" ]]; then 2024-08-20T22:12:35.5469758Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5470388Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5470944Z  2024-08-20T22:12:35.5471458Z  echo "There is no Docker build script in ${REPO_NAME} repo, skipping..." 2024-08-20T22:12:35.5472080Z  exit 0 2024-08-20T22:12:35.5472382Z else 2024-08-20T22:12:35.5472758Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5473212Z fi 2024-08-20T22:12:35.5473491Z  2024-08-20T22:12:35.5473961Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2024-08-20T22:12:35.5474798Z  # The docker image name already includes the ECR prefix and tag, so we can just 2024-08-20T22:12:35.5475556Z  # use it as it is, but first let's extract the tag 2024-08-20T22:12:35.5476255Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2024-08-20T22:12:35.5476968Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5477662Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5478382Z else 2024-08-20T22:12:35.5478823Z  DOCKER_TAG=$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2024-08-20T22:12:35.5479484Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5480413Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5481152Z fi 2024-08-20T22:12:35.5492658Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:35.5493151Z env: 2024-08-20T22:12:35.5493441Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.5493800Z REPO_NAME: pytorch 2024-08-20T22:12:35.5494770Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5495818Z DOCKER_BUILD_DIR: .ci/docker 2024-08-20T22:12:35.5496315Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:35.5496834Z ##[endgroup] 2024-08-20T22:12:35.5529114Z + [[ ! -d .ci/docker ]] 2024-08-20T22:12:35.5529515Z + [[ ! -f .ci/docker/build.sh ]] 2024-08-20T22:12:35.5529884Z + echo skip=false 2024-08-20T22:12:35.5531508Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2024-08-20T22:12:35.5537577Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5538667Z ++ awk -F '[:,]' '{print $2}' 2024-08-20T22:12:35.5565004Z + DOCKER_TAG=f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5565919Z + echo docker-tag=f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5567609Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5596754Z ##[group]Run set +e 2024-08-20T22:12:35.5597112Z set +e 2024-08-20T22:12:35.5597412Z set -x 2024-08-20T22:12:35.5597694Z  2024-08-20T22:12:35.5597972Z login() { 2024-08-20T22:12:35.5598613Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-08-20T22:12:35.5599307Z } 2024-08-20T22:12:35.5599660Z  2024-08-20T22:12:35.5599949Z retry () { 2024-08-20T22:12:35.5600328Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-08-20T22:12:35.5600756Z } 2024-08-20T22:12:35.5601307Z  2024-08-20T22:12:35.5601681Z retry login "${DOCKER_REGISTRY}" 2024-08-20T22:12:35.5602205Z  2024-08-20T22:12:35.5602911Z # Check if image already exists, if it does then skip building it 2024-08-20T22:12:35.5603722Z if docker manifest inspect "${DOCKER_IMAGE}"; then 2024-08-20T22:12:35.5604378Z  exit 0 2024-08-20T22:12:35.5604824Z fi 2024-08-20T22:12:35.5605155Z  2024-08-20T22:12:35.5605823Z # NB: This part requires a full checkout. Otherwise, the merge base will 2024-08-20T22:12:35.5606768Z # be empty. The default action would be to continue rebuild the image 2024-08-20T22:12:35.5607583Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2024-08-20T22:12:35.5608328Z  # if we're on the base branch then use the parent commit 2024-08-20T22:12:35.5609047Z  MERGE_BASE=$(git rev-parse HEAD~) 2024-08-20T22:12:35.5609583Z else 2024-08-20T22:12:35.5610102Z  # otherwise we're on a PR, so use the most recent base commit 2024-08-20T22:12:35.5610909Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2024-08-20T22:12:35.5611523Z fi 2024-08-20T22:12:35.5611860Z  2024-08-20T22:12:35.5612363Z if [[ -z "${MERGE_BASE}" ]]; then 2024-08-20T22:12:35.5613211Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5613719Z  2024-08-20T22:12:35.5614502Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2024-08-20T22:12:35.5615361Z  exit 0 2024-08-20T22:12:35.5615717Z fi 2024-08-20T22:12:35.5616159Z  2024-08-20T22:12:35.5616681Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2024-08-20T22:12:35.5617794Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2024-08-20T22:12:35.5618699Z  exit 1 2024-08-20T22:12:35.5619058Z fi 2024-08-20T22:12:35.5619512Z  2024-08-20T22:12:35.5620078Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2024-08-20T22:12:35.5621190Z # If no image exists but the hash is the same as the previous hash then we should error out here 2024-08-20T22:12:35.5622118Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2024-08-20T22:12:35.5623253Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2024-08-20T22:12:35.5624402Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2024-08-20T22:12:35.5625150Z fi 2024-08-20T22:12:35.5625474Z  2024-08-20T22:12:35.5626173Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-08-20T22:12:35.5635478Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:35.5636096Z env: 2024-08-20T22:12:35.5636573Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:35.5637045Z DOCKER_BUILD_DIR: .ci/docker 2024-08-20T22:12:35.5637596Z BASE_REVISION: 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:12:35.5638803Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5640083Z DOCKER_TAG: f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:35.5640766Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:35.5641397Z ##[endgroup] 2024-08-20T22:12:35.5673213Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:35.5674068Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:35.5676513Z + aws ecr get-login-password --region us-east-1 2024-08-20T22:12:35.5677353Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.1007159Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2024-08-20T22:12:36.1008181Z Configure a credential helper to remove this warning. See 2024-08-20T22:12:36.1009438Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-08-20T22:12:36.1009979Z 2024-08-20T22:12:36.1010174Z Login Succeeded 2024-08-20T22:12:36.1031697Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.3092350Z { 2024-08-20T22:12:36.3093386Z "schemaVersion": 2, 2024-08-20T22:12:36.3094316Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2024-08-20T22:12:36.3095196Z "config": { 2024-08-20T22:12:36.3096205Z "mediaType": "application/vnd.docker.container.image.v1+json", 2024-08-20T22:12:36.3097549Z "size": 49381, 2024-08-20T22:12:36.3098401Z "digest": "sha256:0e10e8f27e667bb8effbc6011be2637bcca6e8a882c60bd12fee3a26ef6dcc2e" 2024-08-20T22:12:36.3099534Z }, 2024-08-20T22:12:36.3100061Z "layers": [ 2024-08-20T22:12:36.3100626Z { 2024-08-20T22:12:36.3101316Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3102140Z "size": 28584317, 2024-08-20T22:12:36.3103154Z "digest": "sha256:63e9bbe323274e77e58d77c6ab6802d247458f784222fbb07a2556d6ec74ee05" 2024-08-20T22:12:36.3104337Z }, 2024-08-20T22:12:36.3105006Z { 2024-08-20T22:12:36.3105757Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3106600Z "size": 7944771, 2024-08-20T22:12:36.3107443Z "digest": "sha256:cfb3d849840ee60cee7b02bad68c1fc3c15928ebcd88f327754766b670578ed6" 2024-08-20T22:12:36.3108329Z }, 2024-08-20T22:12:36.3109408Z { 2024-08-20T22:12:36.3110127Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3111055Z "size": 57593718, 2024-08-20T22:12:36.3111950Z "digest": "sha256:968831e596a6288f0fed9b8612ee4ee8e75511037c4305058805492c5162e481" 2024-08-20T22:12:36.3112902Z }, 2024-08-20T22:12:36.3113937Z { 2024-08-20T22:12:36.3114659Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3115456Z "size": 187, 2024-08-20T22:12:36.3116815Z "digest": "sha256:ea310eb267ca1cab61b6b16f566cd28bfd59a741395a011f5e76716e15ba57c6" 2024-08-20T22:12:36.3117725Z }, 2024-08-20T22:12:36.3118812Z { 2024-08-20T22:12:36.3120047Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3121151Z "size": 6885, 2024-08-20T22:12:36.3122157Z "digest": "sha256:3af11d09e9cd1eb9c379f0a4071231e5a5642eb728b4b33bcb76be291f3c9488" 2024-08-20T22:12:36.3123402Z }, 2024-08-20T22:12:36.3124051Z { 2024-08-20T22:12:36.3125095Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3126566Z "size": 1361380219, 2024-08-20T22:12:36.3127523Z "digest": "sha256:ebfec18059b91e56882881ac34754f917861edb5f732c395d2a1a851bbd6db46" 2024-08-20T22:12:36.3129848Z }, 2024-08-20T22:12:36.3130299Z { 2024-08-20T22:12:36.3131057Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3131974Z "size": 62686, 2024-08-20T22:12:36.3132865Z "digest": "sha256:533b4aebf16914c763b7b0de3ce657590c6f979045e9fdf1f816adaf68d8a4d3" 2024-08-20T22:12:36.3134183Z }, 2024-08-20T22:12:36.3134694Z { 2024-08-20T22:12:36.3135594Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3136609Z "size": 1685, 2024-08-20T22:12:36.3137655Z "digest": "sha256:9dd75d06a0910f28cb1e484b8808724e5a6ee570ecb8fc04631368f546b39ed9" 2024-08-20T22:12:36.3138690Z }, 2024-08-20T22:12:36.3139184Z { 2024-08-20T22:12:36.3140089Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3140977Z "size": 1523, 2024-08-20T22:12:36.3141788Z "digest": "sha256:30bfca4dd3492d60ed8035b0eeb1229897041140db117f5663465a551e25851d" 2024-08-20T22:12:36.3142594Z }, 2024-08-20T22:12:36.3142906Z { 2024-08-20T22:12:36.3143430Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3144158Z "size": 2608186849, 2024-08-20T22:12:36.3144672Z + exit 0 2024-08-20T22:12:36.3145304Z "digest": "sha256:1b57ce94cad9a5097d99b8d6f7bcd51df5a162fc3c5f5686b689b76993724bc8" 2024-08-20T22:12:36.3146067Z }, 2024-08-20T22:12:36.3146374Z { 2024-08-20T22:12:36.3146909Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3147594Z "size": 86631, 2024-08-20T22:12:36.3148195Z "digest": "sha256:9ee6bdb31195dd42fd98147a75d540efe0c5708e0ecda866a0bca060084fddab" 2024-08-20T22:12:36.3148914Z }, 2024-08-20T22:12:36.3149650Z { 2024-08-20T22:12:36.3150139Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3150795Z "size": 1823, 2024-08-20T22:12:36.3151474Z "digest": "sha256:a9f7203f6bc50d4d132288f21cf8e36cef04388ae9240484d490b7e48142843a" 2024-08-20T22:12:36.3152156Z }, 2024-08-20T22:12:36.3152512Z { 2024-08-20T22:12:36.3153071Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3153669Z "size": 245782392, 2024-08-20T22:12:36.3154327Z "digest": "sha256:3dc3f3c4cd525b407f05b785034da8553a7f2ea726a603b0ff77265cb015d410" 2024-08-20T22:12:36.3155078Z }, 2024-08-20T22:12:36.3155384Z { 2024-08-20T22:12:36.3156057Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3156939Z "size": 544, 2024-08-20T22:12:36.3157553Z "digest": "sha256:7bf8ebdbbd747d46300b76e548a001b7235efe26c29dfb42ab6b269d72f92683" 2024-08-20T22:12:36.3158254Z }, 2024-08-20T22:12:36.3158639Z { 2024-08-20T22:12:36.3159164Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3159913Z "size": 1258, 2024-08-20T22:12:36.3160607Z "digest": "sha256:e4173def2b15c621a12e3f25565172f47da48b380be947cede0928fcf5b6ca8c" 2024-08-20T22:12:36.3161266Z }, 2024-08-20T22:12:36.3161608Z { 2024-08-20T22:12:36.3162210Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3162849Z "size": 484, 2024-08-20T22:12:36.3163478Z "digest": "sha256:9d3daead8a92451a8443563b6ad8139a99dfdba1870cc47c33db5ca7f9771eea" 2024-08-20T22:12:36.3164247Z }, 2024-08-20T22:12:36.3164547Z { 2024-08-20T22:12:36.3165067Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3165817Z "size": 91712284, 2024-08-20T22:12:36.3166428Z "digest": "sha256:91f8b300cda0616d9bb53f9d90ebe0bebb9e4103e941d4f262fd524f4a4930a6" 2024-08-20T22:12:36.3167126Z }, 2024-08-20T22:12:36.3167534Z { 2024-08-20T22:12:36.3168366Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3169017Z "size": 3392, 2024-08-20T22:12:36.3169933Z "digest": "sha256:06c119c80bda2e10c11c134e30ac53fb758ded056401e4e85c57cf5de937a72c" 2024-08-20T22:12:36.3170609Z }, 2024-08-20T22:12:36.3170974Z { 2024-08-20T22:12:36.3171543Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3172134Z "size": 1909, 2024-08-20T22:12:36.3172787Z "digest": "sha256:12deefe7ec2331fbd252e4789b972b17bc4e38f59134da879e0ca96be2e57463" 2024-08-20T22:12:36.3173558Z }, 2024-08-20T22:12:36.3173862Z { 2024-08-20T22:12:36.3174397Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3175085Z "size": 701, 2024-08-20T22:12:36.3175673Z "digest": "sha256:e86f6b18e52521ed5d0b66da3b3792b884be964967f366627aec9817c3fe07c1" 2024-08-20T22:12:36.3176385Z }, 2024-08-20T22:12:36.3176764Z { 2024-08-20T22:12:36.3177271Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3177909Z "size": 2880118413, 2024-08-20T22:12:36.3178606Z "digest": "sha256:fae3d4b5c8c3998cda920b00f0803f07eaefc5dde0c4cc4c5600349060397ff7" 2024-08-20T22:12:36.3179294Z }, 2024-08-20T22:12:36.3179632Z { 2024-08-20T22:12:36.3180223Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3180837Z "size": 380, 2024-08-20T22:12:36.3181447Z "digest": "sha256:441a9565ab064f6d7facc749a9fde594243084009a971fa7a5dc20213379ca8a" 2024-08-20T22:12:36.3182177Z }, 2024-08-20T22:12:36.3182505Z { 2024-08-20T22:12:36.3183023Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3183720Z "size": 12875, 2024-08-20T22:12:36.3184320Z "digest": "sha256:74ea64d31b5c80c9c8f852fcb370a96a934f419a0f7cdd1171e1fc8ac8e7111a" 2024-08-20T22:12:36.3185063Z }, 2024-08-20T22:12:36.3185430Z { 2024-08-20T22:12:36.3185945Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3186613Z "size": 804, 2024-08-20T22:12:36.3187245Z "digest": "sha256:781e2451882628557e5b96cc38155f738d0e0e6db6139389491450b7a0455d55" 2024-08-20T22:12:36.3187915Z }, 2024-08-20T22:12:36.3188254Z { 2024-08-20T22:12:36.3188816Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3189450Z "size": 106, 2024-08-20T22:12:36.3190104Z "digest": "sha256:a38ba7c59bd8c03e183ba3c7eb6f8851d32ff5a72e102eef7ecb367cacd10ae5" 2024-08-20T22:12:36.3190820Z }, 2024-08-20T22:12:36.3191157Z { 2024-08-20T22:12:36.3191694Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3192334Z "size": 503, 2024-08-20T22:12:36.3192995Z "digest": "sha256:b62a470dc39d4d3b5713c2701e5c088c88543919479bae0189708525e549a423" 2024-08-20T22:12:36.3193869Z }, 2024-08-20T22:12:36.3194228Z { 2024-08-20T22:12:36.3194778Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3195490Z "size": 121477336, 2024-08-20T22:12:36.3196213Z "digest": "sha256:e94cd6ec737dbeac616815042f077ba6907fb7cb447932e13d2683f5a6bf965b" 2024-08-20T22:12:36.3196994Z }, 2024-08-20T22:12:36.3197336Z { 2024-08-20T22:12:36.3197901Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3198613Z "size": 109, 2024-08-20T22:12:36.3199286Z "digest": "sha256:d678a40efe95ed2cf8316191a80da72d6ed7f1899dec98e98151b55ed05bd3a3" 2024-08-20T22:12:36.3200139Z }, 2024-08-20T22:12:36.3200500Z { 2024-08-20T22:12:36.3201039Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3201674Z "size": 488, 2024-08-20T22:12:36.3202311Z "digest": "sha256:e3ae19135e0f0974a6ff47fc195e5d325f50712714f8e57a4ef1dbd4ab94b44c" 2024-08-20T22:12:36.3203056Z }, 2024-08-20T22:12:36.3203392Z { 2024-08-20T22:12:36.3203921Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3204550Z "size": 297, 2024-08-20T22:12:36.3205189Z "digest": "sha256:8c9bd10f9f83f5aac7df2c36bab80ef124ef39cb142e99ac8af9f90178bcaf86" 2024-08-20T22:12:36.3205905Z }, 2024-08-20T22:12:36.3206236Z { 2024-08-20T22:12:36.3206874Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3207534Z "size": 103, 2024-08-20T22:12:36.3208148Z "digest": "sha256:26d0925d23675bddf086faecad22830252be6642241c5e1d6c3db68dd41ec42c" 2024-08-20T22:12:36.3208867Z }, 2024-08-20T22:12:36.3209195Z { 2024-08-20T22:12:36.3209722Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3210366Z "size": 1473, 2024-08-20T22:12:36.3210985Z "digest": "sha256:f222d352c9fd6c313beb9dae5b68918624aa2e0aff94f93d0604317f7256737b" 2024-08-20T22:12:36.3211713Z }, 2024-08-20T22:12:36.3212062Z { 2024-08-20T22:12:36.3212573Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3213222Z "size": 594731648, 2024-08-20T22:12:36.3213867Z "digest": "sha256:08f43deb0c579afa701893b3e33e574d5708d6a493a08a59e39841220a67370c" 2024-08-20T22:12:36.3214579Z }, 2024-08-20T22:12:36.3214935Z { 2024-08-20T22:12:36.3215463Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3216097Z "size": 163, 2024-08-20T22:12:36.3216749Z "digest": "sha256:b04d9f9eedf512e2f423edecaa6d12447054cc37a4e2fbbeb4c5a88c536fcb08" 2024-08-20T22:12:36.3217469Z }, 2024-08-20T22:12:36.3217805Z { 2024-08-20T22:12:36.3218337Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3218976Z "size": 563, 2024-08-20T22:12:36.3219619Z "digest": "sha256:a6f8cfbf12ea24ed10851ea125d80f2e6a6ff87323f6e07e83f259bde8cd202b" 2024-08-20T22:12:36.3220310Z }, 2024-08-20T22:12:36.3220666Z { 2024-08-20T22:12:36.3221234Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3221856Z "size": 35865650, 2024-08-20T22:12:36.3222544Z "digest": "sha256:816ff351c38c8768f89b7e807fd5a9281f42d3edda531ae9c313c333e5a9e517" 2024-08-20T22:12:36.3223247Z }, 2024-08-20T22:12:36.3223574Z { 2024-08-20T22:12:36.3224098Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3224735Z "size": 104, 2024-08-20T22:12:36.3225347Z "digest": "sha256:829068a22c0dc6f847392e0e14b4f64c12d117e38107c7bbad0a88fee6134ed5" 2024-08-20T22:12:36.3226042Z }, 2024-08-20T22:12:36.3226394Z { 2024-08-20T22:12:36.3226899Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3227533Z "size": 425, 2024-08-20T22:12:36.3228212Z "digest": "sha256:f2f2568ce4a4bcb9d0294f88dec911bace8437b912a37a758de8e9adacf0b472" 2024-08-20T22:12:36.3228910Z }, 2024-08-20T22:12:36.3229277Z { 2024-08-20T22:12:36.3229900Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3230525Z "size": 20262180, 2024-08-20T22:12:36.3231210Z "digest": "sha256:4724466097d45508ac5e6e0fcaebd51be8568863888b3f66ed0728b32b304550" 2024-08-20T22:12:36.3231889Z }, 2024-08-20T22:12:36.3232248Z { 2024-08-20T22:12:36.3232829Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3233463Z "size": 436, 2024-08-20T22:12:36.3234108Z "digest": "sha256:87447c3113eb76640fb1cef6054024fdbc11d2b24af7788036ffbb45c39261b0" 2024-08-20T22:12:36.3234843Z }, 2024-08-20T22:12:36.3235182Z { 2024-08-20T22:12:36.3235720Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3236350Z "size": 701, 2024-08-20T22:12:36.3236955Z "digest": "sha256:e86f6b18e52521ed5d0b66da3b3792b884be964967f366627aec9817c3fe07c1" 2024-08-20T22:12:36.3237657Z }, 2024-08-20T22:12:36.3237997Z { 2024-08-20T22:12:36.3238509Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3239166Z "size": 142, 2024-08-20T22:12:36.3239844Z "digest": "sha256:4c0385306092a03eb28c7316b88c99680313444a6ffccc9d6b247fb9efee2aa2" 2024-08-20T22:12:36.3240529Z }, 2024-08-20T22:12:36.3240883Z { 2024-08-20T22:12:36.3241437Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3242081Z "size": 135, 2024-08-20T22:12:36.3242843Z "digest": "sha256:b2c2ed7cb6f8682545419347a3f26b49a1c15bfe811d7d1c499466d460bc32c2" 2024-08-20T22:12:36.3243555Z }, 2024-08-20T22:12:36.3243923Z { 2024-08-20T22:12:36.3244450Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3245090Z "size": 32, 2024-08-20T22:12:36.3245732Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3246431Z }, 2024-08-20T22:12:36.3246780Z { 2024-08-20T22:12:36.3247320Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3247987Z "size": 195, 2024-08-20T22:12:36.3248629Z "digest": "sha256:5b02284802e208cecd9363696481375f7ce2a27d9720f1e8ebc7fa7175fceffa" 2024-08-20T22:12:36.3249309Z }, 2024-08-20T22:12:36.3249645Z { 2024-08-20T22:12:36.3250212Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3250847Z "size": 564, 2024-08-20T22:12:36.3251466Z "digest": "sha256:008d94b6385203fa677bed332797a3c60f8d264fac284d46174a9f31e616df5e" 2024-08-20T22:12:36.3252194Z }, 2024-08-20T22:12:36.3252544Z { 2024-08-20T22:12:36.3253058Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3253720Z "size": 43163484, 2024-08-20T22:12:36.3254348Z "digest": "sha256:c89112952ad8ae3d720361500d8a534f31ab01574eab7028257ab2fe6a219f5a" 2024-08-20T22:12:36.3255092Z }, 2024-08-20T22:12:36.3255450Z { 2024-08-20T22:12:36.3255969Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3256619Z "size": 106, 2024-08-20T22:12:36.3257264Z "digest": "sha256:cd13d939f03ff75dd7b2cacf82137836becf96dc951040162c272573ebe2885e" 2024-08-20T22:12:36.3257960Z }, 2024-08-20T22:12:36.3258318Z { 2024-08-20T22:12:36.3258847Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3259493Z "size": 1380, 2024-08-20T22:12:36.3260144Z "digest": "sha256:ec89139a4f20a3e982faf9d4c8de5fa13c0480e848a8da10089bf444e169462b" 2024-08-20T22:12:36.3260850Z }, 2024-08-20T22:12:36.3261187Z { 2024-08-20T22:12:36.3261768Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3262413Z "size": 701, 2024-08-20T22:12:36.3263033Z "digest": "sha256:e86f6b18e52521ed5d0b66da3b3792b884be964967f366627aec9817c3fe07c1" 2024-08-20T22:12:36.3263802Z }, 2024-08-20T22:12:36.3264145Z { 2024-08-20T22:12:36.3264684Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3265315Z "size": 139, 2024-08-20T22:12:36.3265939Z "digest": "sha256:9a7c32b815947d61184ba97586eb4db7aceeecd60713443c6fa0116c52fb9b7a" 2024-08-20T22:12:36.3266743Z }, 2024-08-20T22:12:36.3267086Z { 2024-08-20T22:12:36.3267835Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3268576Z "size": 119, 2024-08-20T22:12:36.3269218Z "digest": "sha256:f8e09cd4393bc7fa40e75c19103ade26a2f3acb39a9d4f549dbeaa97ce7437b5" 2024-08-20T22:12:36.3269974Z }, 2024-08-20T22:12:36.3270311Z { 2024-08-20T22:12:36.3270833Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3271499Z "size": 4327076168, 2024-08-20T22:12:36.3272131Z "digest": "sha256:d37c4b502304d150ea99af74c74856779f89b74477545826a1b7076c766dae73" 2024-08-20T22:12:36.3272816Z }, 2024-08-20T22:12:36.3273197Z { 2024-08-20T22:12:36.3273718Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3274373Z "size": 175, 2024-08-20T22:12:36.3275011Z "digest": "sha256:9c1a57a5c5dd6d27fa23c81e3b891766208a32b5162bbd6be79ae928add2c2e5" 2024-08-20T22:12:36.3275759Z }, 2024-08-20T22:12:36.3276122Z { 2024-08-20T22:12:36.3276667Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3277300Z "size": 907, 2024-08-20T22:12:36.3277926Z "digest": "sha256:5bb16aa8a1a3c0dbc595c8190070095762665f78ebc2906932858898729c50f0" 2024-08-20T22:12:36.3278620Z }, 2024-08-20T22:12:36.3278962Z { 2024-08-20T22:12:36.3279497Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3280394Z "size": 701, 2024-08-20T22:12:36.3281046Z "digest": "sha256:e86f6b18e52521ed5d0b66da3b3792b884be964967f366627aec9817c3fe07c1" 2024-08-20T22:12:36.3281742Z }, 2024-08-20T22:12:36.3282129Z { 2024-08-20T22:12:36.3282718Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3283356Z "size": 134, 2024-08-20T22:12:36.3283983Z "digest": "sha256:573ff564cd644e2dacfc144bcc4c0929677d4574e9bc697f54fb3873719e7463" 2024-08-20T22:12:36.3284709Z }, 2024-08-20T22:12:36.3285044Z { 2024-08-20T22:12:36.3285566Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3286221Z "size": 32, 2024-08-20T22:12:36.3286850Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3287550Z }, 2024-08-20T22:12:36.3287910Z { 2024-08-20T22:12:36.3288431Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3289117Z "size": 158, 2024-08-20T22:12:36.3289767Z "digest": "sha256:510b8a72136f9c949a62c204c1106bfc18aeb2a4b43c172fb5ab2ed6494fef96" 2024-08-20T22:12:36.3290459Z }, 2024-08-20T22:12:36.3290826Z { 2024-08-20T22:12:36.3291341Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3291973Z "size": 1841, 2024-08-20T22:12:36.3292640Z "digest": "sha256:04cfd53511b800a94eb18766854b2fc75bd06604fa5d485ace99b2de08a6e8dd" 2024-08-20T22:12:36.3293323Z }, 2024-08-20T22:12:36.3293652Z { 2024-08-20T22:12:36.3294196Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3294829Z "size": 7529774, 2024-08-20T22:12:36.3295461Z "digest": "sha256:9e1581e2c0ea0c1bc8d3ad0937afff1b050d1a42fb914b00180b760f677ea28d" 2024-08-20T22:12:36.3296262Z }, 2024-08-20T22:12:36.3296561Z { 2024-08-20T22:12:36.3297096Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3297729Z "size": 164, 2024-08-20T22:12:36.3298355Z "digest": "sha256:8d7e63c166b2be4f3f898a357ba5aee5071643edb969682fa45e2c9d6bd62938" 2024-08-20T22:12:36.3299067Z }, 2024-08-20T22:12:36.3299409Z { 2024-08-20T22:12:36.3299921Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3300569Z "size": 7942, 2024-08-20T22:12:36.3301237Z "digest": "sha256:39801cea2f0a2890bd79daa9b3707b6e57eeec0b04f01e644a3f28d2b99ebc93" 2024-08-20T22:12:36.3301898Z }, 2024-08-20T22:12:36.3302286Z { 2024-08-20T22:12:36.3302804Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3303603Z "size": 8067, 2024-08-20T22:12:36.3304233Z "digest": "sha256:7f9ad806fd9061bd0c0f15f8b5fadb98812afbc51fb5a4f2a84dc091e5ed04eb" 2024-08-20T22:12:36.3304986Z }, 2024-08-20T22:12:36.3305312Z { 2024-08-20T22:12:36.3305833Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3306511Z "size": 301, 2024-08-20T22:12:36.3307119Z "digest": "sha256:3c03760db75fb358ea0d78621768e8c88ea0494282befaeb29bb388349106a36" 2024-08-20T22:12:36.3307808Z }, 2024-08-20T22:12:36.3308199Z { 2024-08-20T22:12:36.3308707Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3309381Z "size": 7630231, 2024-08-20T22:12:36.3310111Z "digest": "sha256:2ba7af7a62d5a901402f9d60bb5756505ffa47dd8ca691ddbe1e6ed4b9db7184" 2024-08-20T22:12:36.3310781Z }, 2024-08-20T22:12:36.3311122Z { 2024-08-20T22:12:36.3311710Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3312322Z "size": 108, 2024-08-20T22:12:36.3312967Z "digest": "sha256:57e34ad7d122c435a45ce249e0b98b57658c1c9327371dde61242f6880b7cc86" 2024-08-20T22:12:36.3313709Z }, 2024-08-20T22:12:36.3314004Z { 2024-08-20T22:12:36.3314515Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3315214Z "size": 54145659, 2024-08-20T22:12:36.3315811Z "digest": "sha256:b90d6140b3b9633f37ae52f27ceacc79ea43ec28e3e90d7956514f95fb07e18a" 2024-08-20T22:12:36.3316668Z }, 2024-08-20T22:12:36.3317069Z { 2024-08-20T22:12:36.3317551Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3318204Z "size": 473, 2024-08-20T22:12:36.3318858Z "digest": "sha256:d5363180761d2c7b2ae38902885d83e86faa7b310bef0f60925d2709a21034fe" 2024-08-20T22:12:36.3319503Z }, 2024-08-20T22:12:36.3319947Z { 2024-08-20T22:12:36.3320521Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3321126Z "size": 1374858411, 2024-08-20T22:12:36.3321778Z "digest": "sha256:f79f8c85c5737659b4f63c15376488e2435f6619d5518f39ac88d7e25b2fe85d" 2024-08-20T22:12:36.3322553Z }, 2024-08-20T22:12:36.3322891Z { 2024-08-20T22:12:36.3323440Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3324115Z "size": 106, 2024-08-20T22:12:36.3324701Z "digest": "sha256:7eeb12acd9735a89c45dde9e88c5b13229af46d99a52a1706dd407bbb678b57a" 2024-08-20T22:12:36.3325417Z }, 2024-08-20T22:12:36.3325801Z { 2024-08-20T22:12:36.3326300Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3326932Z "size": 559, 2024-08-20T22:12:36.3327587Z "digest": "sha256:d960221b0fe709481836e604198d698a6a5e5b336e6671a77dde3daa15ccd820" 2024-08-20T22:12:36.3328257Z }, 2024-08-20T22:12:36.3328603Z { 2024-08-20T22:12:36.3329163Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3329826Z "size": 46248339, 2024-08-20T22:12:36.3330448Z "digest": "sha256:18b851371523787123983dbf8343ebbe899631adaae72840f032439f709a9b94" 2024-08-20T22:12:36.3331170Z }, 2024-08-20T22:12:36.3331492Z { 2024-08-20T22:12:36.3332003Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3332732Z "size": 111, 2024-08-20T22:12:36.3333337Z "digest": "sha256:be2e37c8b6929209a06d8f4985b38972fcf4d05aa2d76683b3d8f8ca9072c317" 2024-08-20T22:12:36.3334027Z }, 2024-08-20T22:12:36.3334488Z { 2024-08-20T22:12:36.3334972Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3335626Z "size": 32, 2024-08-20T22:12:36.3336305Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3336971Z }, 2024-08-20T22:12:36.3337480Z { 2024-08-20T22:12:36.3338130Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3338970Z "size": 32, 2024-08-20T22:12:36.3339672Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3340481Z }, 2024-08-20T22:12:36.3340783Z { 2024-08-20T22:12:36.3341387Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3342034Z "size": 32, 2024-08-20T22:12:36.3342676Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3343397Z }, 2024-08-20T22:12:36.3343784Z { 2024-08-20T22:12:36.3344288Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-08-20T22:12:36.3344952Z "size": 32, 2024-08-20T22:12:36.3345545Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2024-08-20T22:12:36.3346277Z } 2024-08-20T22:12:36.3346653Z ] 2024-08-20T22:12:36.3347008Z } 2024-08-20T22:12:36.3456972Z ##[group]Run tag=${ECR_DOCKER_IMAGE##*/} 2024-08-20T22:12:36.3457546Z tag=${ECR_DOCKER_IMAGE##*/} 2024-08-20T22:12:36.3458195Z echo "docker pull ghcr.io/pytorch/ci-image:${tag/:/-}" 2024-08-20T22:12:36.3658578Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:36.3659325Z env: 2024-08-20T22:12:36.3659708Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:36.3661244Z ECR_DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.3662749Z ##[endgroup] 2024-08-20T22:12:36.3703673Z docker pull ghcr.io/pytorch/ci-image:pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9-f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.3758116Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2024-08-20T22:12:36.3758692Z with: 2024-08-20T22:12:36.3759714Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.3760903Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.3761414Z env: 2024-08-20T22:12:36.3761716Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:36.3762075Z ##[endgroup] 2024-08-20T22:12:36.3781289Z ##[group]Run set -x 2024-08-20T22:12:36.3781636Z set -x 2024-08-20T22:12:36.3781942Z set +e 2024-08-20T22:12:36.3782245Z  2024-08-20T22:12:36.3782522Z login() { 2024-08-20T22:12:36.3783177Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-08-20T22:12:36.3783883Z } 2024-08-20T22:12:36.3784159Z  2024-08-20T22:12:36.3784469Z retry () { 2024-08-20T22:12:36.3784851Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-08-20T22:12:36.3785299Z } 2024-08-20T22:12:36.3785571Z  2024-08-20T22:12:36.3785876Z retry login "${DOCKER_REGISTRY}" 2024-08-20T22:12:36.3786287Z  2024-08-20T22:12:36.3786563Z set -e 2024-08-20T22:12:36.3787026Z # ignore output since only exit code is used for conditional 2024-08-20T22:12:36.3787719Z # only pull docker image if it's not available locally 2024-08-20T22:12:36.3788466Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2024-08-20T22:12:36.3789141Z  retry docker pull "${DOCKER_IMAGE}" 2024-08-20T22:12:36.3789570Z fi 2024-08-20T22:12:36.3798485Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:12:36.3798977Z env: 2024-08-20T22:12:36.3799255Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:12:36.3800331Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.3801477Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.3801984Z ##[endgroup] 2024-08-20T22:12:36.3831084Z + set +e 2024-08-20T22:12:36.3831626Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.3832269Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.3834933Z + aws ecr get-login-password --region us-east-1 2024-08-20T22:12:36.3836284Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-08-20T22:12:36.9484251Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2024-08-20T22:12:36.9485303Z Configure a credential helper to remove this warning. See 2024-08-20T22:12:36.9486326Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-08-20T22:12:36.9486913Z 2024-08-20T22:12:36.9487056Z Login Succeeded 2024-08-20T22:12:36.9507130Z + set -e 2024-08-20T22:12:36.9508307Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.9670734Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:36.9672551Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:12:37.2194426Z f6d216893d65c7b8ae43df4daaf247db808378e9: Pulling from pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9 2024-08-20T22:12:37.2197733Z 63e9bbe32327: Pulling fs layer 2024-08-20T22:12:37.2198286Z cfb3d849840e: Pulling fs layer 2024-08-20T22:12:37.2198899Z 968831e596a6: Pulling fs layer 2024-08-20T22:12:37.2199415Z ea310eb267ca: Pulling fs layer 2024-08-20T22:12:37.2200490Z 3af11d09e9cd: Pulling fs layer 2024-08-20T22:12:37.2201003Z ebfec18059b9: Pulling fs layer 2024-08-20T22:12:37.2201469Z 533b4aebf169: Pulling fs layer 2024-08-20T22:12:37.2201850Z 9dd75d06a091: Pulling fs layer 2024-08-20T22:12:37.2202215Z 30bfca4dd349: Pulling fs layer 2024-08-20T22:12:37.2202701Z 1b57ce94cad9: Pulling fs layer 2024-08-20T22:12:37.2203192Z 9ee6bdb31195: Pulling fs layer 2024-08-20T22:12:37.2203660Z a9f7203f6bc5: Pulling fs layer 2024-08-20T22:12:37.2204096Z 3dc3f3c4cd52: Pulling fs layer 2024-08-20T22:12:37.2204584Z 7bf8ebdbbd74: Pulling fs layer 2024-08-20T22:12:37.2205049Z e4173def2b15: Pulling fs layer 2024-08-20T22:12:37.2205539Z 9d3daead8a92: Pulling fs layer 2024-08-20T22:12:37.2206006Z 91f8b300cda0: Pulling fs layer 2024-08-20T22:12:37.2206465Z 06c119c80bda: Pulling fs layer 2024-08-20T22:12:37.2206936Z 12deefe7ec23: Pulling fs layer 2024-08-20T22:12:37.2207407Z e86f6b18e525: Pulling fs layer 2024-08-20T22:12:37.2207869Z fae3d4b5c8c3: Pulling fs layer 2024-08-20T22:12:37.2208361Z 441a9565ab06: Pulling fs layer 2024-08-20T22:12:37.2208825Z 74ea64d31b5c: Pulling fs layer 2024-08-20T22:12:37.2209276Z 781e24518826: Pulling fs layer 2024-08-20T22:12:37.2209752Z a38ba7c59bd8: Pulling fs layer 2024-08-20T22:12:37.2210219Z ebfec18059b9: Waiting 2024-08-20T22:12:37.2210655Z b62a470dc39d: Pulling fs layer 2024-08-20T22:12:37.2211031Z e94cd6ec737d: Pulling fs layer 2024-08-20T22:12:37.2211391Z 533b4aebf169: Waiting 2024-08-20T22:12:37.2211723Z d678a40efe95: Pulling fs layer 2024-08-20T22:12:37.2212097Z e3ae19135e0f: Pulling fs layer 2024-08-20T22:12:37.2212447Z 9dd75d06a091: Waiting 2024-08-20T22:12:37.2212769Z 8c9bd10f9f83: Pulling fs layer 2024-08-20T22:12:37.2213131Z 30bfca4dd349: Waiting 2024-08-20T22:12:37.2213462Z 26d0925d2367: Pulling fs layer 2024-08-20T22:12:37.2213827Z f222d352c9fd: Pulling fs layer 2024-08-20T22:12:37.2214192Z 08f43deb0c57: Pulling fs layer 2024-08-20T22:12:37.2214553Z 1b57ce94cad9: Waiting 2024-08-20T22:12:37.2214883Z b04d9f9eedf5: Pulling fs layer 2024-08-20T22:12:37.2215258Z a6f8cfbf12ea: Pulling fs layer 2024-08-20T22:12:37.2215626Z 816ff351c38c: Pulling fs layer 2024-08-20T22:12:37.2215986Z 829068a22c0d: Pulling fs layer 2024-08-20T22:12:37.2216337Z 9ee6bdb31195: Waiting 2024-08-20T22:12:37.2216667Z f2f2568ce4a4: Pulling fs layer 2024-08-20T22:12:37.2217042Z 4724466097d4: Pulling fs layer 2024-08-20T22:12:37.2217460Z 87447c3113eb: Pulling fs layer 2024-08-20T22:12:37.2217961Z 4c0385306092: Pulling fs layer 2024-08-20T22:12:37.2218573Z a9f7203f6bc5: Waiting 2024-08-20T22:12:37.2218904Z 7bf8ebdbbd74: Waiting 2024-08-20T22:12:37.2219240Z b2c2ed7cb6f8: Pulling fs layer 2024-08-20T22:12:37.2219605Z 91f8b300cda0: Waiting 2024-08-20T22:12:37.2219918Z ea310eb267ca: Waiting 2024-08-20T22:12:37.2220236Z e4173def2b15: Waiting 2024-08-20T22:12:37.2220563Z 12deefe7ec23: Waiting 2024-08-20T22:12:37.2220902Z 4f4fb700ef54: Pulling fs layer 2024-08-20T22:12:37.2221268Z 06c119c80bda: Waiting 2024-08-20T22:12:37.2221595Z 3dc3f3c4cd52: Waiting 2024-08-20T22:12:37.2221915Z e86f6b18e525: Waiting 2024-08-20T22:12:37.2222246Z 5b02284802e2: Pulling fs layer 2024-08-20T22:12:37.2222657Z 9d3daead8a92: Waiting 2024-08-20T22:12:37.2223027Z f2f2568ce4a4: Waiting 2024-08-20T22:12:37.2223367Z 008d94b63852: Pulling fs layer 2024-08-20T22:12:37.2223765Z 4724466097d4: Waiting 2024-08-20T22:12:37.2224095Z c89112952ad8: Pulling fs layer 2024-08-20T22:12:37.2224456Z e94cd6ec737d: Waiting 2024-08-20T22:12:37.2224783Z a6f8cfbf12ea: Waiting 2024-08-20T22:12:37.2225124Z cd13d939f03f: Pulling fs layer 2024-08-20T22:12:37.2225555Z ec89139a4f20: Pulling fs layer 2024-08-20T22:12:37.2225925Z fae3d4b5c8c3: Waiting 2024-08-20T22:12:37.2226257Z 9a7c32b81594: Pulling fs layer 2024-08-20T22:12:37.2226616Z 4c0385306092: Waiting 2024-08-20T22:12:37.2226924Z b2c2ed7cb6f8: Waiting 2024-08-20T22:12:37.2227320Z 441a9565ab06: Waiting 2024-08-20T22:12:37.2227732Z f8e09cd4393b: Pulling fs layer 2024-08-20T22:12:37.2228105Z d37c4b502304: Pulling fs layer 2024-08-20T22:12:37.2228455Z 781e24518826: Waiting 2024-08-20T22:12:37.2228901Z 74ea64d31b5c: Waiting 2024-08-20T22:12:37.2229233Z 9c1a57a5c5dd: Pulling fs layer 2024-08-20T22:12:37.2229594Z b04d9f9eedf5: Waiting 2024-08-20T22:12:37.2229912Z a38ba7c59bd8: Waiting 2024-08-20T22:12:37.2230221Z ec89139a4f20: Waiting 2024-08-20T22:12:37.2230536Z b62a470dc39d: Waiting 2024-08-20T22:12:37.2230851Z 9a7c32b81594: Waiting 2024-08-20T22:12:37.2231166Z 87447c3113eb: Waiting 2024-08-20T22:12:37.2231491Z d37c4b502304: Waiting 2024-08-20T22:12:37.2231834Z 5bb16aa8a1a3: Pulling fs layer 2024-08-20T22:12:37.2232196Z 26d0925d2367: Waiting 2024-08-20T22:12:37.2232514Z 9c1a57a5c5dd: Waiting 2024-08-20T22:12:37.2232850Z 573ff564cd64: Pulling fs layer 2024-08-20T22:12:37.2233220Z 510b8a72136f: Pulling fs layer 2024-08-20T22:12:37.2233595Z 04cfd53511b8: Pulling fs layer 2024-08-20T22:12:37.2233976Z 9e1581e2c0ea: Pulling fs layer 2024-08-20T22:12:37.2234350Z 8d7e63c166b2: Pulling fs layer 2024-08-20T22:12:37.2234801Z cd13d939f03f: Waiting 2024-08-20T22:12:37.2235141Z 39801cea2f0a: Pulling fs layer 2024-08-20T22:12:37.2235515Z 7f9ad806fd90: Pulling fs layer 2024-08-20T22:12:37.2235884Z 3c03760db75f: Pulling fs layer 2024-08-20T22:12:37.2236257Z 2ba7af7a62d5: Pulling fs layer 2024-08-20T22:12:37.2236618Z 57e34ad7d122: Pulling fs layer 2024-08-20T22:12:37.2236969Z 829068a22c0d: Waiting 2024-08-20T22:12:37.2237292Z b90d6140b3b9: Pulling fs layer 2024-08-20T22:12:37.2237653Z d5363180761d: Pulling fs layer 2024-08-20T22:12:37.2238004Z f8e09cd4393b: Waiting 2024-08-20T22:12:37.2238322Z 8d7e63c166b2: Waiting 2024-08-20T22:12:37.2238644Z f79f8c85c573: Pulling fs layer 2024-08-20T22:12:37.2239009Z 7eeb12acd973: Pulling fs layer 2024-08-20T22:12:37.2239367Z d678a40efe95: Waiting 2024-08-20T22:12:37.2239772Z 573ff564cd64: Waiting 2024-08-20T22:12:37.2240098Z d960221b0fe7: Pulling fs layer 2024-08-20T22:12:37.2240456Z 39801cea2f0a: Waiting 2024-08-20T22:12:37.2240777Z 18b851371523: Pulling fs layer 2024-08-20T22:12:37.2241146Z be2e37c8b692: Pulling fs layer 2024-08-20T22:12:37.2241506Z 08f43deb0c57: Waiting 2024-08-20T22:12:37.2241817Z 2ba7af7a62d5: Waiting 2024-08-20T22:12:37.2242127Z 8c9bd10f9f83: Waiting 2024-08-20T22:12:37.2242434Z 57e34ad7d122: Waiting 2024-08-20T22:12:37.2242739Z be2e37c8b692: Waiting 2024-08-20T22:12:37.2243046Z f79f8c85c573: Waiting 2024-08-20T22:12:37.2243354Z 04cfd53511b8: Waiting 2024-08-20T22:12:37.2243653Z 008d94b63852: Waiting 2024-08-20T22:12:37.2243961Z d960221b0fe7: Waiting 2024-08-20T22:12:37.2244275Z 7eeb12acd973: Waiting 2024-08-20T22:12:37.2244682Z c89112952ad8: Waiting 2024-08-20T22:12:37.2245001Z 18b851371523: Waiting 2024-08-20T22:12:37.2245322Z 3c03760db75f: Waiting 2024-08-20T22:12:37.2245631Z b90d6140b3b9: Waiting 2024-08-20T22:12:37.2245946Z 9e1581e2c0ea: Waiting 2024-08-20T22:12:37.2246258Z 510b8a72136f: Waiting 2024-08-20T22:12:37.3513049Z cfb3d849840e: Verifying Checksum 2024-08-20T22:12:37.3513628Z cfb3d849840e: Download complete 2024-08-20T22:12:37.4246969Z ea310eb267ca: Verifying Checksum 2024-08-20T22:12:37.4247551Z ea310eb267ca: Download complete 2024-08-20T22:12:37.5141762Z 3af11d09e9cd: Verifying Checksum 2024-08-20T22:12:37.5142222Z 3af11d09e9cd: Download complete 2024-08-20T22:12:37.6094123Z 63e9bbe32327: Download complete 2024-08-20T22:12:37.6828704Z 533b4aebf169: Download complete 2024-08-20T22:12:37.7518830Z 9dd75d06a091: Download complete 2024-08-20T22:12:37.8193197Z 30bfca4dd349: Verifying Checksum 2024-08-20T22:12:37.8193793Z 30bfca4dd349: Download complete 2024-08-20T22:12:37.8436088Z 968831e596a6: Verifying Checksum 2024-08-20T22:12:37.8436654Z 968831e596a6: Download complete 2024-08-20T22:12:38.0680363Z 9ee6bdb31195: Download complete 2024-08-20T22:12:38.1667544Z a9f7203f6bc5: Verifying Checksum 2024-08-20T22:12:38.1668277Z a9f7203f6bc5: Download complete 2024-08-20T22:12:38.7831931Z 63e9bbe32327: Pull complete 2024-08-20T22:12:39.0878110Z cfb3d849840e: Pull complete 2024-08-20T22:12:40.0486831Z 968831e596a6: Pull complete 2024-08-20T22:12:40.0645164Z ea310eb267ca: Pull complete 2024-08-20T22:12:40.0807255Z 3af11d09e9cd: Pull complete 2024-08-20T22:12:40.6749582Z 3dc3f3c4cd52: Verifying Checksum 2024-08-20T22:12:40.6750073Z 3dc3f3c4cd52: Download complete 2024-08-20T22:12:40.7458558Z 7bf8ebdbbd74: Verifying Checksum 2024-08-20T22:12:40.7459041Z 7bf8ebdbbd74: Download complete 2024-08-20T22:12:40.8427314Z e4173def2b15: Download complete 2024-08-20T22:12:40.9135784Z 9d3daead8a92: Verifying Checksum 2024-08-20T22:12:40.9136328Z 9d3daead8a92: Download complete 2024-08-20T22:12:41.8868776Z 91f8b300cda0: Verifying Checksum 2024-08-20T22:12:41.8869268Z 91f8b300cda0: Download complete 2024-08-20T22:12:41.9579502Z 06c119c80bda: Download complete 2024-08-20T22:12:42.0317275Z 12deefe7ec23: Verifying Checksum 2024-08-20T22:12:42.0318381Z 12deefe7ec23: Download complete 2024-08-20T22:12:42.1179670Z e86f6b18e525: Verifying Checksum 2024-08-20T22:12:42.1180244Z e86f6b18e525: Download complete 2024-08-20T22:12:51.1883800Z ebfec18059b9: Verifying Checksum 2024-08-20T22:12:51.1884420Z ebfec18059b9: Download complete 2024-08-20T22:12:51.2649302Z 441a9565ab06: Verifying Checksum 2024-08-20T22:12:51.2649769Z 441a9565ab06: Download complete 2024-08-20T22:12:51.3536110Z 74ea64d31b5c: Download complete 2024-08-20T22:12:51.4179090Z 781e24518826: Download complete 2024-08-20T22:12:51.4988720Z a38ba7c59bd8: Verifying Checksum 2024-08-20T22:12:51.4989163Z a38ba7c59bd8: Download complete 2024-08-20T22:12:51.5689204Z b62a470dc39d: Download complete 2024-08-20T22:12:52.8469590Z e94cd6ec737d: Verifying Checksum 2024-08-20T22:12:52.8470054Z e94cd6ec737d: Download complete 2024-08-20T22:12:52.9126983Z d678a40efe95: Verifying Checksum 2024-08-20T22:12:52.9127433Z d678a40efe95: Download complete 2024-08-20T22:12:53.0096212Z e3ae19135e0f: Verifying Checksum 2024-08-20T22:12:53.0096797Z e3ae19135e0f: Download complete 2024-08-20T22:12:53.0863115Z 8c9bd10f9f83: Verifying Checksum 2024-08-20T22:12:53.0863681Z 8c9bd10f9f83: Download complete 2024-08-20T22:12:53.1601277Z 26d0925d2367: Verifying Checksum 2024-08-20T22:12:53.1601837Z 26d0925d2367: Download complete 2024-08-20T22:12:53.2307436Z f222d352c9fd: Verifying Checksum 2024-08-20T22:12:53.2307985Z f222d352c9fd: Download complete 2024-08-20T22:12:59.2357727Z 08f43deb0c57: Verifying Checksum 2024-08-20T22:12:59.2358302Z 08f43deb0c57: Download complete 2024-08-20T22:12:59.3045367Z b04d9f9eedf5: Verifying Checksum 2024-08-20T22:12:59.3045821Z b04d9f9eedf5: Download complete 2024-08-20T22:12:59.3834232Z a6f8cfbf12ea: Verifying Checksum 2024-08-20T22:12:59.3834675Z a6f8cfbf12ea: Download complete 2024-08-20T22:12:59.8734971Z 816ff351c38c: Verifying Checksum 2024-08-20T22:12:59.8735617Z 816ff351c38c: Download complete 2024-08-20T22:12:59.9484705Z 829068a22c0d: Download complete 2024-08-20T22:13:00.0443372Z f2f2568ce4a4: Verifying Checksum 2024-08-20T22:13:00.0443934Z f2f2568ce4a4: Download complete 2024-08-20T22:13:00.3027386Z 4724466097d4: Verifying Checksum 2024-08-20T22:13:00.3027846Z 4724466097d4: Download complete 2024-08-20T22:13:00.3677522Z 87447c3113eb: Download complete 2024-08-20T22:13:00.4465466Z 4c0385306092: Verifying Checksum 2024-08-20T22:13:00.4465927Z 4c0385306092: Download complete 2024-08-20T22:13:00.5209528Z b2c2ed7cb6f8: Verifying Checksum 2024-08-20T22:13:00.5210058Z b2c2ed7cb6f8: Download complete 2024-08-20T22:13:00.5331330Z 4f4fb700ef54: Verifying Checksum 2024-08-20T22:13:00.5331778Z 4f4fb700ef54: Download complete 2024-08-20T22:13:00.6060016Z 5b02284802e2: Verifying Checksum 2024-08-20T22:13:00.6060463Z 5b02284802e2: Download complete 2024-08-20T22:13:00.6884510Z 008d94b63852: Verifying Checksum 2024-08-20T22:13:00.6885157Z 008d94b63852: Download complete 2024-08-20T22:13:01.4067109Z c89112952ad8: Verifying Checksum 2024-08-20T22:13:01.4067922Z c89112952ad8: Download complete 2024-08-20T22:13:01.4827016Z cd13d939f03f: Download complete 2024-08-20T22:13:01.6029222Z ec89139a4f20: Verifying Checksum 2024-08-20T22:13:01.6029822Z ec89139a4f20: Download complete 2024-08-20T22:13:01.6814094Z 9a7c32b81594: Verifying Checksum 2024-08-20T22:13:01.6814682Z 9a7c32b81594: Download complete 2024-08-20T22:13:01.7597327Z f8e09cd4393b: Verifying Checksum 2024-08-20T22:13:01.7598229Z f8e09cd4393b: Download complete 2024-08-20T22:13:03.9814009Z 1b57ce94cad9: Verifying Checksum 2024-08-20T22:13:03.9814561Z 1b57ce94cad9: Download complete 2024-08-20T22:13:04.0680567Z 9c1a57a5c5dd: Download complete 2024-08-20T22:13:04.1447768Z 5bb16aa8a1a3: Verifying Checksum 2024-08-20T22:13:04.1448235Z 5bb16aa8a1a3: Download complete 2024-08-20T22:13:04.2182625Z 573ff564cd64: Verifying Checksum 2024-08-20T22:13:04.2183072Z 573ff564cd64: Download complete 2024-08-20T22:13:04.2852717Z 510b8a72136f: Verifying Checksum 2024-08-20T22:13:04.2853156Z 510b8a72136f: Download complete 2024-08-20T22:13:04.3636853Z 04cfd53511b8: Download complete 2024-08-20T22:13:04.5090980Z 9e1581e2c0ea: Verifying Checksum 2024-08-20T22:13:04.5091427Z 9e1581e2c0ea: Download complete 2024-08-20T22:13:04.5618762Z ebfec18059b9: Pull complete 2024-08-20T22:13:04.5881118Z 8d7e63c166b2: Verifying Checksum 2024-08-20T22:13:04.5881669Z 8d7e63c166b2: Download complete 2024-08-20T22:13:04.6611734Z 39801cea2f0a: Verifying Checksum 2024-08-20T22:13:04.6612311Z 39801cea2f0a: Download complete 2024-08-20T22:13:04.7425577Z 7f9ad806fd90: Verifying Checksum 2024-08-20T22:13:04.7426141Z 7f9ad806fd90: Download complete 2024-08-20T22:13:04.7443150Z 533b4aebf169: Pull complete 2024-08-20T22:13:04.8172900Z 3c03760db75f: Verifying Checksum 2024-08-20T22:13:04.8173434Z 3c03760db75f: Download complete 2024-08-20T22:13:04.9540799Z 9dd75d06a091: Pull complete 2024-08-20T22:13:04.9592307Z 2ba7af7a62d5: Verifying Checksum 2024-08-20T22:13:04.9592746Z 2ba7af7a62d5: Download complete 2024-08-20T22:13:05.0226436Z 57e34ad7d122: Download complete 2024-08-20T22:13:05.0883492Z 30bfca4dd349: Pull complete 2024-08-20T22:13:05.6217312Z b90d6140b3b9: Verifying Checksum 2024-08-20T22:13:05.6217906Z b90d6140b3b9: Download complete 2024-08-20T22:13:05.6944299Z d5363180761d: Download complete 2024-08-20T22:13:11.0689223Z fae3d4b5c8c3: Verifying Checksum 2024-08-20T22:13:11.0689707Z fae3d4b5c8c3: Download complete 2024-08-20T22:13:11.1457099Z 7eeb12acd973: Verifying Checksum 2024-08-20T22:13:11.1457723Z 7eeb12acd973: Download complete 2024-08-20T22:13:11.2207165Z d960221b0fe7: Verifying Checksum 2024-08-20T22:13:11.2207739Z d960221b0fe7: Download complete 2024-08-20T22:13:11.7715691Z 18b851371523: Verifying Checksum 2024-08-20T22:13:11.7716332Z 18b851371523: Download complete 2024-08-20T22:13:11.8509658Z be2e37c8b692: Verifying Checksum 2024-08-20T22:13:11.8510182Z be2e37c8b692: Download complete 2024-08-20T22:13:19.6081244Z f79f8c85c573: Download complete 2024-08-20T22:13:45.1879298Z d37c4b502304: Download complete 2024-08-20T22:14:00.0762729Z 1b57ce94cad9: Pull complete 2024-08-20T22:14:00.3052508Z 9ee6bdb31195: Pull complete 2024-08-20T22:14:00.5386423Z a9f7203f6bc5: Pull complete 2024-08-20T22:14:09.1261173Z 3dc3f3c4cd52: Pull complete 2024-08-20T22:14:09.3514027Z 7bf8ebdbbd74: Pull complete 2024-08-20T22:14:09.5696286Z e4173def2b15: Pull complete 2024-08-20T22:14:09.7934062Z 9d3daead8a92: Pull complete 2024-08-20T22:14:12.4309935Z 91f8b300cda0: Pull complete 2024-08-20T22:14:12.6624461Z 06c119c80bda: Pull complete 2024-08-20T22:14:12.8940525Z 12deefe7ec23: Pull complete 2024-08-20T22:14:13.1118148Z e86f6b18e525: Pull complete 2024-08-20T22:15:11.9947527Z fae3d4b5c8c3: Pull complete 2024-08-20T22:15:12.1810751Z 441a9565ab06: Pull complete 2024-08-20T22:15:12.3479023Z 74ea64d31b5c: Pull complete 2024-08-20T22:15:12.4534469Z 781e24518826: Pull complete 2024-08-20T22:15:12.5193115Z a38ba7c59bd8: Pull complete 2024-08-20T22:15:12.6205338Z b62a470dc39d: Pull complete 2024-08-20T22:15:15.9946386Z e94cd6ec737d: Pull complete 2024-08-20T22:15:16.1162199Z d678a40efe95: Pull complete 2024-08-20T22:15:16.2406313Z e3ae19135e0f: Pull complete 2024-08-20T22:15:16.4780272Z 8c9bd10f9f83: Pull complete 2024-08-20T22:15:16.6709805Z 26d0925d2367: Pull complete 2024-08-20T22:15:16.8947618Z f222d352c9fd: Pull complete 2024-08-20T22:15:27.3657613Z 08f43deb0c57: Pull complete 2024-08-20T22:15:27.4696465Z b04d9f9eedf5: Pull complete 2024-08-20T22:15:27.6609638Z a6f8cfbf12ea: Pull complete 2024-08-20T22:15:28.5737590Z 816ff351c38c: Pull complete 2024-08-20T22:15:28.7132297Z 829068a22c0d: Pull complete 2024-08-20T22:15:28.8578348Z f2f2568ce4a4: Pull complete 2024-08-20T22:15:29.2385780Z 4724466097d4: Pull complete 2024-08-20T22:15:29.3751074Z 87447c3113eb: Pull complete 2024-08-20T22:15:29.5868529Z 4c0385306092: Pull complete 2024-08-20T22:15:29.6372236Z b2c2ed7cb6f8: Pull complete 2024-08-20T22:15:29.7338479Z 4f4fb700ef54: Pull complete 2024-08-20T22:15:29.8500443Z 5b02284802e2: Pull complete 2024-08-20T22:15:29.9376374Z 008d94b63852: Pull complete 2024-08-20T22:15:32.5310616Z c89112952ad8: Pull complete 2024-08-20T22:15:32.7503294Z cd13d939f03f: Pull complete 2024-08-20T22:15:32.9187777Z ec89139a4f20: Pull complete 2024-08-20T22:15:33.1011800Z 9a7c32b81594: Pull complete 2024-08-20T22:15:33.3035079Z f8e09cd4393b: Pull complete 2024-08-20T22:16:55.4458738Z d37c4b502304: Pull complete 2024-08-20T22:16:55.6749531Z 9c1a57a5c5dd: Pull complete 2024-08-20T22:16:55.9020664Z 5bb16aa8a1a3: Pull complete 2024-08-20T22:16:56.3474712Z 573ff564cd64: Pull complete 2024-08-20T22:16:56.7846632Z 510b8a72136f: Pull complete 2024-08-20T22:16:57.0158660Z 04cfd53511b8: Pull complete 2024-08-20T22:16:57.3910337Z 9e1581e2c0ea: Pull complete 2024-08-20T22:16:57.4332767Z 8d7e63c166b2: Pull complete 2024-08-20T22:16:57.4487493Z 39801cea2f0a: Pull complete 2024-08-20T22:16:57.4648990Z 7f9ad806fd90: Pull complete 2024-08-20T22:16:57.4795455Z 3c03760db75f: Pull complete 2024-08-20T22:16:58.5763450Z 2ba7af7a62d5: Pull complete 2024-08-20T22:16:58.5932486Z 57e34ad7d122: Pull complete 2024-08-20T22:17:00.6406359Z b90d6140b3b9: Pull complete 2024-08-20T22:17:00.8644078Z d5363180761d: Pull complete 2024-08-20T22:17:14.1348836Z f79f8c85c573: Pull complete 2024-08-20T22:17:14.3679551Z 7eeb12acd973: Pull complete 2024-08-20T22:17:14.5994841Z d960221b0fe7: Pull complete 2024-08-20T22:17:15.4736802Z 18b851371523: Pull complete 2024-08-20T22:17:15.7068872Z be2e37c8b692: Pull complete 2024-08-20T22:17:16.7310180Z Digest: sha256:b9572efb5db3e0afffd147f045c42cead9b20264103a375915f77858de439a1d 2024-08-20T22:17:16.7790326Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:17:16.8053887Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:17:16.8157100Z ##[group]Run echo "IN_ARC_RUNNER=$([ -f /.inarc ] && echo true || echo false)" >> "$GITHUB_OUTPUT" 2024-08-20T22:17:16.8158249Z echo "IN_ARC_RUNNER=$([ -f /.inarc ] && echo true || echo false)" >> "$GITHUB_OUTPUT" 2024-08-20T22:17:16.8168333Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:17:16.8168829Z env: 2024-08-20T22:17:16.8169109Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:17:16.8169451Z ##[endgroup] 2024-08-20T22:17:16.8353154Z ##[group]Run pytorch/test-infra/.github/actions/setup-nvidia@main 2024-08-20T22:17:16.8353689Z with: 2024-08-20T22:17:16.8353971Z driver-version: 550.54.15 2024-08-20T22:17:16.8354309Z env: 2024-08-20T22:17:16.8354574Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:17:16.8354917Z ##[endgroup] 2024-08-20T22:17:16.8530048Z ##[group]Run nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482 2024-08-20T22:17:16.8530601Z with: 2024-08-20T22:17:16.8530872Z timeout_minutes: 10 2024-08-20T22:17:16.8531202Z max_attempts: 3 2024-08-20T22:17:16.8563106Z command: # Is it disgusting to have a full shell script here in this github action? Sure # But is it the best way to make it so that this action relies on nothing else? Absolutely set -eou pipefail DISTRIBUTION=$(. /etc/os-release;echo $ID$VERSION_ID) DRIVER_FN="NVIDIA-Linux-x86_64-${DRIVER_VERSION}.run" install_nvidia_docker2_amzn2() { ( set -x # Needed for yum-config-manager sudo yum install -y yum-utils if [[ "${DISTRIBUTION}" == "amzn2023" ]] ; then YUM_REPO_URL="https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo" else # Amazon Linux 2 YUM_REPO_URL="https://nvidia.github.io/nvidia-docker/${DISTRIBUTION}/nvidia-docker.repo" fi sudo yum-config-manager --add-repo "${YUM_REPO_URL}" sudo yum install -y nvidia-docker2 sudo systemctl restart docker ) } install_nvidia_docker2_ubuntu20() { ( set -x # Install nvidia-driver package if not installed status="$(dpkg-query -W --showformat='${db:Status-Status}' nvidia-docker2 2>&1)" if [ ! $? = 0 ] || [ ! "$status" = installed ]; then sudo apt-get install -y nvidia-docker2 sudo systemctl restart docker fi ) } pre_install_nvidia_driver_amzn2() { ( # Purge any nvidia driver installed from RHEL repo sudo yum remove -y nvidia-driver-latest-dkms ) } install_nvidia_driver_common() { ( # Try to gather more information about the runner and its existing NVIDIA driver if any echo "Before installing NVIDIA driver" lspci lsmod modinfo nvidia || true HAS_NVIDIA_DRIVER=0 # Check if NVIDIA driver has already been installed if [ -x "$(command -v nvidia-smi)" ]; then set +e # The driver exists, check its version next. Also check only the first GPU if there are more than one of them # so that the same driver version is not print over multiple lines INSTALLED_DRIVER_VERSION=$(nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0) NVIDIA_SMI_STATUS=$? if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then echo "Failed to get NVIDIA driver version ($INSTALLED_DRIVER_VERSION). Continuing" elif [ "$INSTALLED_DRIVER_VERSION" != "$DRIVER_VERSION" ]; then echo "NVIDIA driver ($INSTALLED_DRIVER_VERSION) has been installed, but we expect to have $DRIVER_VERSION instead. Continuing" else HAS_NVIDIA_DRIVER=1 echo "NVIDIA driver ($INSTALLED_DRIVER_VERSION) has already been installed. Skipping NVIDIA driver installation" fi set -e fi if [ "$HAS_NVIDIA_DRIVER" -eq 0 ]; then # CAUTION: this may need to be updated in future if [ "${DISTRIBUTION}" != ubuntu20.04 ]; then sudo yum groupinstall -y "Development Tools" # ensure our kernel install is the same as our underlying kernel, # groupinstall "Development Tools" has a habit of mismatching kernel headers sudo yum install -y "kernel-devel-uname-r == $(uname -r)" sudo modprobe backlight fi sudo curl -fsL -o /tmp/nvidia_driver "https://s3.amazonaws.com/ossci-linux/nvidia_driver/$DRIVER_FN" set +e sudo /bin/bash /tmp/nvidia_driver -s --no-drm NVIDIA_INSTALLATION_STATUS=$? RESET_GPU=0 if [ "$NVIDIA_INSTALLATION_STATUS" -ne 0 ]; then sudo cat /var/log/nvidia-installer.log # Fail to install NVIDIA driver, try to reset the GPU RESET_GPU=1 elif [ -x "$(command -v nvidia-smi)" ]; then # Check again if nvidia-smi works even if the driver installation completes successfully INSTALLED_DRIVER_VERSION=$(nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0) NVIDIA_SMI_STATUS=$? if [ "$NVIDIA_SMI_STATUS" -ne 0 ] && [ "$NVIDIA_SMI_STATUS" -ne 14 ]; then RESET_GPU=1 fi fi if [ "$RESET_GPU" -eq 1 ]; then NVIDIA_DEVICES=$(lspci -D | grep -i NVIDIA | cut -d' ' -f1) # The GPU can get stuck in a failure state if somehow the test crashs the GPU microcode. When this # happens, we'll try to reset all NVIDIA devices https://github.com/pytorch/pytorch/issues/88388 for PCI_ID in $NVIDIA_DEVICES; do DEVICE_ENABLED=$(cat /sys/bus/pci/devices/$PCI_ID/enable) echo "Reseting $PCI_ID (enabled state: $DEVICE_ENABLED)" # This requires sudo permission of course echo "1" | sudo tee /sys/bus/pci/devices/$PCI_ID/reset sleep 1 done fi sudo rm -fv /tmp/nvidia_driver set -e fi ) } post_install_nvidia_driver_common() { ( sudo modprobe nvidia || true echo "After installing NVIDIA driver" lspci lsmod modinfo nvidia || true ( set +e nvidia-smi # NB: Annoyingly, nvidia-smi command returns successfully with return code 0 even in # the case where the driver has already crashed as it still can get the driver version # and some basic information like the bus ID. However, the rest of the information # would be missing (ERR!), for example: # # +-----------------------------------------------------------------------------+ # | NVIDIA-SMI 525.89.02 Driver Version: 525.89.02 CUDA Version: 12.0 | # |-------------------------------+----------------------+----------------------+ # | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | # | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | # | | | MIG M. | # |===============================+======================+======================| # | 0 ERR! Off | 00000000:00:1E.0 Off | ERR! | # |ERR! ERR! ERR! ERR! / ERR! | 4184MiB / 23028MiB | ERR! Default | # | | | ERR! | # +-------------------------------+----------------------+----------------------+ # # +-----------------------------------------------------------------------------+ # | Processes: | # | GPU GI CI PID Type Process name GPU Memory | # | ID ID Usage | # |=============================================================================| # +-----------------------------------------------------------------------------+ # # This should be reported as a failure instead as it will guarantee to fail when # Docker tries to run with --gpus all # # So, the correct check here is to query one of the missing piece of info like # GPU name, so that the command can fail accordingly nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 NVIDIA_SMI_STATUS=$? # Allowable exit statuses for nvidia-smi, see: https://github.com/NVIDIA/gpu-operator/issues/285 if [ "$NVIDIA_SMI_STATUS" -eq 0 ] || [ "$NVIDIA_SMI_STATUS" -eq 14 ]; then echo "INFO: Ignoring allowed status ${NVIDIA_SMI_STATUS}" else echo "ERROR: nvidia-smi exited with unresolved status ${NVIDIA_SMI_STATUS}" exit ${NVIDIA_SMI_STATUS} fi set -e ) ) } install_nvidia_driver_amzn2() { ( set -x pre_install_nvidia_driver_amzn2 install_nvidia_driver_common post_install_nvidia_driver_common ) } install_nvidia_driver_ubuntu20() { ( set -x install_nvidia_driver_common post_install_nvidia_driver_common ) } echo "== Installing nvidia driver ${DRIVER_FN} ==" case "${DISTRIBUTION}" in amzn*) install_nvidia_driver_amzn2 ;; ubuntu20.04) install_nvidia_driver_ubuntu20 ;; *) echo "ERROR: Unknown distribution ${DISTRIBUTION}" exit 1 ;; esac # Install container toolkit based on distribution echo "== Installing nvidia container toolkit for ${DISTRIBUTION} ==" case "${DISTRIBUTION}" in amzn*) install_nvidia_docker2_amzn2 ;; ubuntu20.04) install_nvidia_docker2_ubuntu20 ;; *) echo "ERROR: Unknown distribution ${DISTRIBUTION}" exit 1 ;; esac echo "GPU_FLAG=--gpus all -e NVIDIA_DRIVER_CAPABILITIES=all" >> "${GITHUB_ENV}" # Fix https://github.com/NVIDIA/nvidia-docker/issues/1648 on runners with # more than one GPUs. This just needs to be run once. The command fails # on subsequent runs and complains that the mode is already on, but that's # ok sudo nvidia-persistenced || true # This should show persistence mode ON nvidia-smi 2024-08-20T22:17:16.8595531Z retry_wait_seconds: 10 2024-08-20T22:17:16.8595895Z polling_interval_seconds: 1 2024-08-20T22:17:16.8596256Z warning_on_retry: true 2024-08-20T22:17:16.8596599Z continue_on_error: false 2024-08-20T22:17:16.8596930Z env: 2024-08-20T22:17:16.8597199Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:17:16.8597550Z DRIVER_VERSION: 550.54.15 2024-08-20T22:17:16.8597892Z ##[endgroup] 2024-08-20T22:17:16.9402486Z == Installing nvidia driver NVIDIA-Linux-x86_64-550.54.15.run == 2024-08-20T22:17:16.9403308Z + pre_install_nvidia_driver_amzn2 2024-08-20T22:17:16.9404288Z + sudo yum remove -y nvidia-driver-latest-dkms 2024-08-20T22:17:17.4150415Z No match for argument: nvidia-driver-latest-dkms 2024-08-20T22:17:17.4151760Z No packages marked for removal. 2024-08-20T22:17:17.4213819Z Dependencies resolved. 2024-08-20T22:17:17.4223636Z Nothing to do. 2024-08-20T22:17:17.4224149Z Complete! 2024-08-20T22:17:17.5068197Z + install_nvidia_driver_common 2024-08-20T22:17:17.5075583Z + echo 'Before installing NVIDIA driver' 2024-08-20T22:17:17.5076018Z + lspci 2024-08-20T22:17:17.5076318Z Before installing NVIDIA driver 2024-08-20T22:17:17.6037834Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2024-08-20T22:17:17.6038602Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2024-08-20T22:17:17.6039516Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2024-08-20T22:17:17.6040394Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 2024-08-20T22:17:17.6041190Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller 2024-08-20T22:17:17.6042233Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2024-08-20T22:17:17.6042957Z 00:1e.0 3D controller: NVIDIA Corporation GA102GL [A10G] (rev a1) 2024-08-20T22:17:17.6043754Z 00:1f.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe SSD Controller 2024-08-20T22:17:17.6044340Z + lsmod 2024-08-20T22:17:17.6083134Z Module Size Used by 2024-08-20T22:17:17.6083563Z xt_conntrack 16384 1 2024-08-20T22:17:17.6083942Z nft_chain_nat 16384 3 2024-08-20T22:17:17.6084308Z xt_MASQUERADE 20480 1 2024-08-20T22:17:17.6084755Z nf_nat 57344 2 nft_chain_nat,xt_MASQUERADE 2024-08-20T22:17:17.6085240Z nf_conntrack_netlink 57344 0 2024-08-20T22:17:17.6088409Z nf_conntrack 184320 4 xt_conntrack,nf_nat,nf_conntrack_netlink,xt_MASQUERADE 2024-08-20T22:17:17.6089201Z nf_defrag_ipv6 24576 1 nf_conntrack 2024-08-20T22:17:17.6089772Z nf_defrag_ipv4 16384 1 nf_conntrack 2024-08-20T22:17:17.6090214Z xfrm_user 57344 1 2024-08-20T22:17:17.6090611Z xfrm_algo 16384 1 xfrm_user 2024-08-20T22:17:17.6091021Z xt_addrtype 16384 2 2024-08-20T22:17:17.6091386Z nft_compat 20480 4 2024-08-20T22:17:17.6091833Z nf_tables 307200 57 nft_compat,nft_chain_nat 2024-08-20T22:17:17.6092436Z nfnetlink 20480 4 nft_compat,nf_conntrack_netlink,nf_tables 2024-08-20T22:17:17.6092966Z br_netfilter 36864 0 2024-08-20T22:17:17.6093464Z bridge 307200 1 br_netfilter 2024-08-20T22:17:17.6093897Z stp 16384 1 bridge 2024-08-20T22:17:17.6094308Z llc 16384 2 bridge,stp 2024-08-20T22:17:17.6094711Z overlay 167936 0 2024-08-20T22:17:17.6095072Z tls 114688 0 2024-08-20T22:17:17.6095430Z nls_ascii 16384 1 2024-08-20T22:17:17.6095781Z nls_cp437 20480 1 2024-08-20T22:17:17.6096143Z sunrpc 692224 1 2024-08-20T22:17:17.6096507Z vfat 24576 1 2024-08-20T22:17:17.6096862Z fat 86016 1 vfat 2024-08-20T22:17:17.6097244Z ena 167936 0 2024-08-20T22:17:17.6097606Z ghash_clmulni_intel 16384 0 2024-08-20T22:17:17.6097975Z ptp 36864 1 ena 2024-08-20T22:17:17.6098368Z pps_core 24576 1 ptp 2024-08-20T22:17:17.6098757Z aesni_intel 393216 0 2024-08-20T22:17:17.6099107Z i8042 45056 0 2024-08-20T22:17:17.6099539Z serio 28672 3 i8042 2024-08-20T22:17:17.6099969Z crypto_simd 16384 1 aesni_intel 2024-08-20T22:17:17.6100468Z cryptd 28672 2 crypto_simd,ghash_clmulni_intel 2024-08-20T22:17:17.6100954Z button 24576 0 2024-08-20T22:17:17.6101323Z sch_fq_codel 20480 17 2024-08-20T22:17:17.6101684Z dm_mod 188416 0 2024-08-20T22:17:17.6102039Z fuse 163840 1 2024-08-20T22:17:17.6102408Z dax 45056 1 dm_mod 2024-08-20T22:17:17.6102794Z configfs 57344 1 2024-08-20T22:17:17.6103213Z loop 36864 0 2024-08-20T22:17:17.6103648Z dmi_sysfs 20480 0 2024-08-20T22:17:17.6104075Z crc32_pclmul 16384 0 2024-08-20T22:17:17.6104539Z crc32c_intel 24576 0 2024-08-20T22:17:17.6105007Z efivarfs 24576 1 2024-08-20T22:17:17.6105405Z + modinfo nvidia 2024-08-20T22:17:17.6106320Z filename: /lib/modules/6.1.94-99.176.amzn2023.x86_64/kernel/drivers/video/nvidia.ko 2024-08-20T22:17:17.6107017Z alias: char-major-195-* 2024-08-20T22:17:17.6107398Z version: 550.54.15 2024-08-20T22:17:17.6107735Z supported: external 2024-08-20T22:17:17.6108070Z license: NVIDIA 2024-08-20T22:17:17.6108439Z firmware: nvidia/550.54.15/gsp_tu10x.bin 2024-08-20T22:17:17.6108908Z firmware: nvidia/550.54.15/gsp_ga10x.bin 2024-08-20T22:17:17.6109354Z srcversion: 833721318DA517F0C2FEC97 2024-08-20T22:17:17.6109997Z alias: pci:v000010DEd*sv*sd*bc06sc80i00* 2024-08-20T22:17:17.6110480Z alias: pci:v000010DEd*sv*sd*bc03sc02i00* 2024-08-20T22:17:17.6110967Z alias: pci:v000010DEd*sv*sd*bc03sc00i00* 2024-08-20T22:17:17.6111459Z depends: i2c-core,drm 2024-08-20T22:17:17.6111812Z retpoline: Y 2024-08-20T22:17:17.6112119Z name: nvidia 2024-08-20T22:17:17.6112814Z vermagic: 6.1.94-99.176.amzn2023.x86_64 SMP preempt mod_unload modversions 2024-08-20T22:17:17.6113500Z parm: NvSwitchRegDwords:NvSwitch regkey (charp) 2024-08-20T22:17:17.6114136Z parm: NvSwitchBlacklist:NvSwitchBlacklist=uuid[,uuid...] (charp) 2024-08-20T22:17:17.6114733Z parm: NVreg_ResmanDebugLevel:int 2024-08-20T22:17:17.6115195Z parm: NVreg_RmLogonRC:int 2024-08-20T22:17:17.6115621Z parm: NVreg_ModifyDeviceFiles:int 2024-08-20T22:17:17.6116076Z parm: NVreg_DeviceFileUID:int 2024-08-20T22:17:17.6116511Z parm: NVreg_DeviceFileGID:int 2024-08-20T22:17:17.6116948Z parm: NVreg_DeviceFileMode:int 2024-08-20T22:17:17.6117508Z parm: NVreg_InitializeSystemMemoryAllocations:int 2024-08-20T22:17:17.6118095Z parm: NVreg_UsePageAttributeTable:int 2024-08-20T22:17:17.6118587Z parm: NVreg_EnablePCIeGen3:int 2024-08-20T22:17:17.6119020Z parm: NVreg_EnableMSI:int 2024-08-20T22:17:17.6119433Z parm: NVreg_TCEBypassMode:int 2024-08-20T22:17:17.6119986Z parm: NVreg_EnableStreamMemOPs:int 2024-08-20T22:17:17.6120514Z parm: NVreg_RestrictProfilingToAdminUsers:int 2024-08-20T22:17:17.6121080Z parm: NVreg_PreserveVideoMemoryAllocations:int 2024-08-20T22:17:17.6121632Z parm: NVreg_EnableS0ixPowerManagement:int 2024-08-20T22:17:17.6122235Z parm: NVreg_S0ixPowerManagementVideoMemoryThreshold:int 2024-08-20T22:17:17.6122817Z parm: NVreg_DynamicPowerManagement:int 2024-08-20T22:17:17.6123422Z parm: NVreg_DynamicPowerManagementVideoMemoryThreshold:int 2024-08-20T22:17:17.6124016Z parm: NVreg_EnableGpuFirmware:int 2024-08-20T22:17:17.6124496Z parm: NVreg_EnableGpuFirmwareLogs:int 2024-08-20T22:17:17.6125023Z parm: NVreg_OpenRmEnableUnsupportedGpus:int 2024-08-20T22:17:17.6125561Z parm: NVreg_EnableUserNUMAManagement:int 2024-08-20T22:17:17.6126048Z parm: NVreg_MemoryPoolSize:int 2024-08-20T22:17:17.6126513Z parm: NVreg_KMallocHeapMaxSize:int 2024-08-20T22:17:17.6126990Z parm: NVreg_VMallocHeapMaxSize:int 2024-08-20T22:17:17.6127452Z parm: NVreg_IgnoreMMIOCheck:int 2024-08-20T22:17:17.6127895Z parm: NVreg_NvLinkDisable:int 2024-08-20T22:17:17.6128389Z parm: NVreg_EnablePCIERelaxedOrderingMode:int 2024-08-20T22:17:17.6128899Z parm: NVreg_RegisterPCIDriver:int 2024-08-20T22:17:17.6129369Z parm: NVreg_EnableResizableBar:int 2024-08-20T22:17:17.6129850Z parm: NVreg_EnableDbgBreakpoint:int 2024-08-20T22:17:17.6130347Z parm: NVreg_EnableNonblockingOpen:int 2024-08-20T22:17:17.6130825Z parm: NVreg_RegistryDwords:charp 2024-08-20T22:17:17.6131322Z parm: NVreg_RegistryDwordsPerDevice:charp 2024-08-20T22:17:17.6131800Z parm: NVreg_RmMsg:charp 2024-08-20T22:17:17.6132207Z parm: NVreg_GpuBlacklist:charp 2024-08-20T22:17:17.6132684Z parm: NVreg_TemporaryFilePath:charp 2024-08-20T22:17:17.6133158Z parm: NVreg_ExcludedGpus:charp 2024-08-20T22:17:17.6133603Z parm: NVreg_DmaRemapPeerMmio:int 2024-08-20T22:17:17.6134075Z parm: NVreg_RmNvlinkBandwidth:charp 2024-08-20T22:17:17.6134547Z parm: NVreg_ImexChannelCount:int 2024-08-20T22:17:17.6134988Z parm: rm_firmware_active:charp 2024-08-20T22:17:17.6135399Z + HAS_NVIDIA_DRIVER=0 2024-08-20T22:17:17.6135784Z ++ command -v nvidia-smi 2024-08-20T22:17:17.6136180Z + '[' -x /usr/bin/nvidia-smi ']' 2024-08-20T22:17:17.6136550Z + set +e 2024-08-20T22:17:17.6137152Z ++ nvidia-smi --query-gpu=driver_version --format=csv,noheader --id=0 2024-08-20T22:17:19.8854237Z + INSTALLED_DRIVER_VERSION=550.54.15 2024-08-20T22:17:19.8854854Z + NVIDIA_SMI_STATUS=0 2024-08-20T22:17:19.8855534Z + '[' 0 -ne 0 ']' 2024-08-20T22:17:19.8856018Z + '[' 550.54.15 '!=' 550.54.15 ']' 2024-08-20T22:17:19.8856568Z + HAS_NVIDIA_DRIVER=1 2024-08-20T22:17:19.8858045Z + echo 'NVIDIA driver (550.54.15) has already been installed. Skipping NVIDIA driver installation' 2024-08-20T22:17:19.8858800Z + set -e 2024-08-20T22:17:19.8859127Z + '[' 1 -eq 0 ']' 2024-08-20T22:17:19.8859816Z NVIDIA driver (550.54.15) has already been installed. Skipping NVIDIA driver installation 2024-08-20T22:17:19.8860740Z + post_install_nvidia_driver_common 2024-08-20T22:17:19.8861155Z + sudo modprobe nvidia 2024-08-20T22:17:19.9669113Z + echo 'After installing NVIDIA driver' 2024-08-20T22:17:19.9669621Z + lspci 2024-08-20T22:17:19.9669950Z After installing NVIDIA driver 2024-08-20T22:17:19.9776518Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2024-08-20T22:17:19.9777494Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2024-08-20T22:17:19.9778740Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2024-08-20T22:17:19.9779843Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 2024-08-20T22:17:19.9780960Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller 2024-08-20T22:17:19.9782078Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2024-08-20T22:17:19.9783077Z 00:1e.0 3D controller: NVIDIA Corporation GA102GL [A10G] (rev a1) 2024-08-20T22:17:19.9784216Z 00:1f.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe SSD Controller 2024-08-20T22:17:19.9784805Z + lsmod 2024-08-20T22:17:19.9808173Z Module Size Used by 2024-08-20T22:17:19.9808586Z nvidia_uvm 4706304 0 2024-08-20T22:17:19.9809062Z nvidia 54071296 1 nvidia_uvm 2024-08-20T22:17:19.9809592Z drm 602112 1 nvidia 2024-08-20T22:17:19.9810084Z drm_panel_orientation_quirks 28672 1 drm 2024-08-20T22:17:19.9810600Z backlight 24576 1 drm 2024-08-20T22:17:19.9811146Z i2c_core 106496 2 nvidia,drm 2024-08-20T22:17:19.9811697Z xt_conntrack 16384 1 2024-08-20T22:17:19.9812177Z nft_chain_nat 16384 3 2024-08-20T22:17:19.9812680Z xt_MASQUERADE 20480 1 2024-08-20T22:17:19.9813156Z nf_nat 57344 2 nft_chain_nat,xt_MASQUERADE 2024-08-20T22:17:19.9813641Z nf_conntrack_netlink 57344 0 2024-08-20T22:17:19.9814208Z nf_conntrack 184320 4 xt_conntrack,nf_nat,nf_conntrack_netlink,xt_MASQUERADE 2024-08-20T22:17:19.9814836Z nf_defrag_ipv6 24576 1 nf_conntrack 2024-08-20T22:17:19.9815300Z nf_defrag_ipv4 16384 1 nf_conntrack 2024-08-20T22:17:19.9815714Z xfrm_user 57344 1 2024-08-20T22:17:19.9816101Z xfrm_algo 16384 1 xfrm_user 2024-08-20T22:17:19.9816518Z xt_addrtype 16384 2 2024-08-20T22:17:19.9816883Z nft_compat 20480 4 2024-08-20T22:17:19.9817328Z nf_tables 307200 57 nft_compat,nft_chain_nat 2024-08-20T22:17:19.9817926Z nfnetlink 20480 4 nft_compat,nf_conntrack_netlink,nf_tables 2024-08-20T22:17:19.9818451Z br_netfilter 36864 0 2024-08-20T22:17:19.9818859Z bridge 307200 1 br_netfilter 2024-08-20T22:17:19.9819299Z stp 16384 1 bridge 2024-08-20T22:17:19.9819704Z llc 16384 2 bridge,stp 2024-08-20T22:17:19.9820118Z overlay 167936 0 2024-08-20T22:17:19.9820476Z tls 114688 0 2024-08-20T22:17:19.9820826Z nls_ascii 16384 1 2024-08-20T22:17:19.9821188Z nls_cp437 20480 1 2024-08-20T22:17:19.9821559Z sunrpc 692224 1 2024-08-20T22:17:19.9821915Z vfat 24576 1 2024-08-20T22:17:19.9822283Z fat 86016 1 vfat 2024-08-20T22:17:19.9822668Z ena 167936 0 2024-08-20T22:17:19.9823334Z ghash_clmulni_intel 16384 0 2024-08-20T22:17:19.9823722Z ptp 36864 1 ena 2024-08-20T22:17:19.9824115Z pps_core 24576 1 ptp 2024-08-20T22:17:19.9824498Z aesni_intel 393216 0 2024-08-20T22:17:19.9824860Z i8042 45056 0 2024-08-20T22:17:19.9825227Z serio 28672 3 i8042 2024-08-20T22:17:19.9825902Z crypto_simd 16384 1 aesni_intel 2024-08-20T22:17:19.9826593Z cryptd 28672 2 crypto_simd,ghash_clmulni_intel 2024-08-20T22:17:19.9827231Z button 24576 0 2024-08-20T22:17:19.9827698Z sch_fq_codel 20480 17 2024-08-20T22:17:19.9828130Z dm_mod 188416 0 2024-08-20T22:17:19.9828496Z fuse 163840 1 2024-08-20T22:17:19.9828936Z dax 45056 1 dm_mod 2024-08-20T22:17:19.9829378Z configfs 57344 1 2024-08-20T22:17:19.9829848Z loop 36864 0 2024-08-20T22:17:19.9830228Z dmi_sysfs 20480 0 2024-08-20T22:17:19.9830597Z crc32_pclmul 16384 0 2024-08-20T22:17:19.9830966Z crc32c_intel 24576 0 2024-08-20T22:17:19.9831325Z efivarfs 24576 1 2024-08-20T22:17:19.9831692Z + modinfo nvidia 2024-08-20T22:17:19.9832406Z filename: /lib/modules/6.1.94-99.176.amzn2023.x86_64/kernel/drivers/video/nvidia.ko 2024-08-20T22:17:19.9833108Z alias: char-major-195-* 2024-08-20T22:17:19.9833481Z version: 550.54.15 2024-08-20T22:17:19.9833826Z supported: external 2024-08-20T22:17:19.9834164Z license: NVIDIA 2024-08-20T22:17:19.9834541Z firmware: nvidia/550.54.15/gsp_tu10x.bin 2024-08-20T22:17:19.9835020Z firmware: nvidia/550.54.15/gsp_ga10x.bin 2024-08-20T22:17:19.9835473Z srcversion: 833721318DA517F0C2FEC97 2024-08-20T22:17:19.9835928Z alias: pci:v000010DEd*sv*sd*bc06sc80i00* 2024-08-20T22:17:19.9836419Z alias: pci:v000010DEd*sv*sd*bc03sc02i00* 2024-08-20T22:17:19.9836899Z alias: pci:v000010DEd*sv*sd*bc03sc00i00* 2024-08-20T22:17:19.9837391Z depends: i2c-core,drm 2024-08-20T22:17:19.9837750Z retpoline: Y 2024-08-20T22:17:19.9838056Z name: nvidia 2024-08-20T22:17:19.9838659Z vermagic: 6.1.94-99.176.amzn2023.x86_64 SMP preempt mod_unload modversions 2024-08-20T22:17:19.9839351Z parm: NvSwitchRegDwords:NvSwitch regkey (charp) 2024-08-20T22:17:19.9840076Z parm: NvSwitchBlacklist:NvSwitchBlacklist=uuid[,uuid...] (charp) 2024-08-20T22:17:19.9840665Z parm: NVreg_ResmanDebugLevel:int 2024-08-20T22:17:19.9841112Z parm: NVreg_RmLogonRC:int 2024-08-20T22:17:19.9841545Z parm: NVreg_ModifyDeviceFiles:int 2024-08-20T22:17:19.9841994Z parm: NVreg_DeviceFileUID:int 2024-08-20T22:17:19.9842421Z parm: NVreg_DeviceFileGID:int 2024-08-20T22:17:19.9842850Z parm: NVreg_DeviceFileMode:int 2024-08-20T22:17:19.9843366Z parm: NVreg_InitializeSystemMemoryAllocations:int 2024-08-20T22:17:19.9843920Z parm: NVreg_UsePageAttributeTable:int 2024-08-20T22:17:19.9844390Z parm: NVreg_EnablePCIeGen3:int 2024-08-20T22:17:19.9844807Z parm: NVreg_EnableMSI:int 2024-08-20T22:17:19.9845214Z parm: NVreg_TCEBypassMode:int 2024-08-20T22:17:19.9845662Z parm: NVreg_EnableStreamMemOPs:int 2024-08-20T22:17:19.9846186Z parm: NVreg_RestrictProfilingToAdminUsers:int 2024-08-20T22:17:19.9846749Z parm: NVreg_PreserveVideoMemoryAllocations:int 2024-08-20T22:17:19.9847298Z parm: NVreg_EnableS0ixPowerManagement:int 2024-08-20T22:17:19.9847898Z parm: NVreg_S0ixPowerManagementVideoMemoryThreshold:int 2024-08-20T22:17:19.9848475Z parm: NVreg_DynamicPowerManagement:int 2024-08-20T22:17:19.9849075Z parm: NVreg_DynamicPowerManagementVideoMemoryThreshold:int 2024-08-20T22:17:19.9849658Z parm: NVreg_EnableGpuFirmware:int 2024-08-20T22:17:19.9850132Z parm: NVreg_EnableGpuFirmwareLogs:int 2024-08-20T22:17:19.9850798Z parm: NVreg_OpenRmEnableUnsupportedGpus:int 2024-08-20T22:17:19.9851329Z parm: NVreg_EnableUserNUMAManagement:int 2024-08-20T22:17:19.9851812Z parm: NVreg_MemoryPoolSize:int 2024-08-20T22:17:19.9852264Z parm: NVreg_KMallocHeapMaxSize:int 2024-08-20T22:17:19.9852747Z parm: NVreg_VMallocHeapMaxSize:int 2024-08-20T22:17:19.9853200Z parm: NVreg_IgnoreMMIOCheck:int 2024-08-20T22:17:19.9853725Z parm: NVreg_NvLinkDisable:int 2024-08-20T22:17:19.9854230Z parm: NVreg_EnablePCIERelaxedOrderingMode:int 2024-08-20T22:17:19.9854744Z parm: NVreg_RegisterPCIDriver:int 2024-08-20T22:17:19.9855220Z parm: NVreg_EnableResizableBar:int 2024-08-20T22:17:19.9855703Z parm: NVreg_EnableDbgBreakpoint:int 2024-08-20T22:17:19.9856191Z parm: NVreg_EnableNonblockingOpen:int 2024-08-20T22:17:19.9856675Z parm: NVreg_RegistryDwords:charp 2024-08-20T22:17:19.9857171Z parm: NVreg_RegistryDwordsPerDevice:charp 2024-08-20T22:17:19.9857648Z parm: NVreg_RmMsg:charp 2024-08-20T22:17:19.9858072Z parm: NVreg_GpuBlacklist:charp 2024-08-20T22:17:19.9858581Z parm: NVreg_TemporaryFilePath:charp 2024-08-20T22:17:19.9859039Z parm: NVreg_ExcludedGpus:charp 2024-08-20T22:17:19.9859490Z parm: NVreg_DmaRemapPeerMmio:int 2024-08-20T22:17:19.9859973Z parm: NVreg_RmNvlinkBandwidth:charp 2024-08-20T22:17:19.9860437Z parm: NVreg_ImexChannelCount:int 2024-08-20T22:17:19.9860885Z parm: rm_firmware_active:charp 2024-08-20T22:17:19.9861286Z + set +e 2024-08-20T22:17:19.9861594Z + nvidia-smi 2024-08-20T22:17:21.5555067Z Tue Aug 20 22:17:21 2024 2024-08-20T22:17:21.5555926Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:17:21.5556792Z | NVIDIA-SMI 550.54.15 Driver Version: 550.54.15 CUDA Version: 12.4 | 2024-08-20T22:17:21.5557571Z |-----------------------------------------+------------------------+----------------------+ 2024-08-20T22:17:21.5558375Z | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | 2024-08-20T22:17:21.5559305Z | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | 2024-08-20T22:17:21.5560048Z | | | MIG M. | 2024-08-20T22:17:21.5560601Z |=========================================+========================+======================| 2024-08-20T22:17:21.5654528Z | 0 NVIDIA A10G Off | 00000000:00:1E.0 Off | 0 | 2024-08-20T22:17:21.5655241Z | 0% 29C P0 57W / 300W | 0MiB / 23028MiB | 5% Default | 2024-08-20T22:17:21.5655869Z | | | N/A | 2024-08-20T22:17:21.5656566Z +-----------------------------------------+------------------------+----------------------+ 2024-08-20T22:17:21.5657265Z 2024-08-20T22:17:21.5657927Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:17:21.5658550Z | Processes: | 2024-08-20T22:17:21.5659239Z | GPU GI CI PID Type Process name GPU Memory | 2024-08-20T22:17:21.5659919Z | ID ID Usage | 2024-08-20T22:17:21.5660476Z |=========================================================================================| 2024-08-20T22:17:21.5661124Z | No running processes found | 2024-08-20T22:17:21.5661864Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:17:22.1869992Z + nvidia-smi --query-gpu=gpu_name --format=csv,noheader --id=0 2024-08-20T22:17:23.7585649Z NVIDIA A10G 2024-08-20T22:17:24.2055769Z + NVIDIA_SMI_STATUS=0 2024-08-20T22:17:24.2056858Z + '[' 0 -eq 0 ']' 2024-08-20T22:17:24.2057269Z + echo 'INFO: Ignoring allowed status 0' 2024-08-20T22:17:24.2057701Z + set -e 2024-08-20T22:17:24.2057993Z INFO: Ignoring allowed status 0 2024-08-20T22:17:24.2064176Z == Installing nvidia container toolkit for amzn2023 == 2024-08-20T22:17:24.2068300Z + sudo yum install -y yum-utils 2024-08-20T22:17:24.5933512Z Last metadata expiration check: 0:08:20 ago on Tue Aug 20 22:09:04 2024. 2024-08-20T22:17:24.6158309Z Package dnf-utils-4.3.0-13.amzn2023.0.4.noarch is already installed. 2024-08-20T22:17:24.6474593Z Dependencies resolved. 2024-08-20T22:17:24.6611162Z Nothing to do. 2024-08-20T22:17:24.6611610Z Complete! 2024-08-20T22:17:24.7541429Z + [[ amzn2023 == \a\m\z\n\2\0\2\3 ]] 2024-08-20T22:17:24.7542706Z + YUM_REPO_URL=https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2024-08-20T22:17:24.7544030Z + sudo yum-config-manager --add-repo https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2024-08-20T22:17:25.0177754Z Adding repo from: https://nvidia.github.io/libnvidia-container/stable/rpm/nvidia-container-toolkit.repo 2024-08-20T22:17:25.0882240Z + sudo yum install -y nvidia-docker2 2024-08-20T22:17:25.5891053Z nvidia-container-toolkit 11 kB/s | 833 B 00:00 2024-08-20T22:17:25.6112043Z Package nvidia-docker2-2.14.0-1.noarch is already installed. 2024-08-20T22:17:25.6421568Z Dependencies resolved. 2024-08-20T22:17:25.6559249Z Nothing to do. 2024-08-20T22:17:25.6559808Z Complete! 2024-08-20T22:17:25.7484431Z + sudo systemctl restart docker 2024-08-20T22:18:08.3753379Z Tue Aug 20 22:18:08 2024 2024-08-20T22:18:08.3755931Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:18:08.3756882Z | NVIDIA-SMI 550.54.15 Driver Version: 550.54.15 CUDA Version: 12.4 | 2024-08-20T22:18:08.3757693Z |-----------------------------------------+------------------------+----------------------+ 2024-08-20T22:18:08.3758712Z | GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC | 2024-08-20T22:18:08.3760008Z | Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. | 2024-08-20T22:18:08.3760831Z | | | MIG M. | 2024-08-20T22:18:08.3761364Z |=========================================+========================+======================| 2024-08-20T22:18:08.3882322Z | 0 NVIDIA A10G On | 00000000:00:1E.0 Off | 0 | 2024-08-20T22:18:08.3883135Z | 0% 29C P0 58W / 300W | 0MiB / 23028MiB | 5% Default | 2024-08-20T22:18:08.3883766Z | | | N/A | 2024-08-20T22:18:08.3884586Z +-----------------------------------------+------------------------+----------------------+ 2024-08-20T22:18:08.3885169Z 2024-08-20T22:18:08.3885813Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:18:08.3886432Z | Processes: | 2024-08-20T22:18:08.3887153Z | GPU GI CI PID Type Process name GPU Memory | 2024-08-20T22:18:08.3888053Z | ID ID Usage | 2024-08-20T22:18:08.3888694Z |=========================================================================================| 2024-08-20T22:18:08.3889309Z | No running processes found | 2024-08-20T22:18:08.3890072Z +-----------------------------------------------------------------------------------------+ 2024-08-20T22:18:08.9656183Z Command completed after 1 attempt(s). 2024-08-20T22:18:08.9735579Z ##[group]Run python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-08-20T22:18:08.9736333Z python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-08-20T22:18:08.9737002Z python3 -m tools.stats.monitor > usage_log.txt 2>&1 & 2024-08-20T22:18:08.9737631Z echo "monitor-script-pid=${!}" >> "${GITHUB_OUTPUT}" 2024-08-20T22:18:08.9752475Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:08.9752972Z env: 2024-08-20T22:18:08.9753253Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:08.9753698Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:08.9754176Z ##[endgroup] 2024-08-20T22:18:09.2563945Z Defaulting to user installation because normal site-packages is not writeable 2024-08-20T22:18:09.6794562Z Collecting psutil==5.9.1 2024-08-20T22:18:09.7145733Z Downloading psutil-5.9.1-cp39-cp39-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (281 kB) 2024-08-20T22:18:09.7642798Z Collecting nvidia-ml-py==11.525.84 2024-08-20T22:18:09.7680814Z Downloading nvidia_ml_py-11.525.84-py3-none-any.whl (34 kB) 2024-08-20T22:18:09.8536413Z Installing collected packages: psutil, nvidia-ml-py 2024-08-20T22:18:10.0109464Z Successfully installed nvidia-ml-py-11.525.84 psutil-5.9.1 2024-08-20T22:18:10.2057040Z Prepare all required actions 2024-08-20T22:18:10.2057535Z Getting action download info 2024-08-20T22:18:10.3256872Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2024-08-20T22:18:10.5070377Z Download action repository 'actions/download-artifact@v3' (SHA:9bc31d5ccc31df68ecc42ccf4149144866c47d8a) 2024-08-20T22:18:10.6531925Z ##[group]Run ./.github/actions/download-build-artifacts 2024-08-20T22:18:10.6532402Z with: 2024-08-20T22:18:10.6532739Z name: linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:18:10.6533194Z s3-bucket: gha-artifacts 2024-08-20T22:18:10.6533528Z env: 2024-08-20T22:18:10.6533802Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:10.6534237Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:10.6534712Z ##[endgroup] 2024-08-20T22:18:10.6568515Z ##[group]Run seemethere/download-artifact-s3@v4 2024-08-20T22:18:10.6568952Z with: 2024-08-20T22:18:10.6569322Z name: linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:18:10.6569768Z s3-bucket: gha-artifacts 2024-08-20T22:18:10.6570125Z region: us-east-1 2024-08-20T22:18:10.6570421Z env: 2024-08-20T22:18:10.6570687Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:10.6571128Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:10.6571601Z ##[endgroup] 2024-08-20T22:18:11.1274133Z (node:52441) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-08-20T22:18:11.1275212Z 2024-08-20T22:18:11.1275602Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-08-20T22:18:11.1276737Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-08-20T22:18:11.1278091Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-08-20T22:18:11.1898654Z Found 1 objects with prefix pytorch/pytorch/10479310961/linux-focal-cuda12.4-py3.10-gcc9-sm86/ 2024-08-20T22:18:11.1900326Z Starting download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-08-20T22:18:34.4586156Z Finished download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-08-20T22:18:34.4592960Z Artifact download has finished successfully 2024-08-20T22:18:34.4947817Z ##[group]Run unzip -o artifacts.zip 2024-08-20T22:18:34.4948246Z unzip -o artifacts.zip 2024-08-20T22:18:34.4958171Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:34.4958670Z env: 2024-08-20T22:18:34.4958950Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:34.4959400Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:34.4960199Z ##[endgroup] 2024-08-20T22:18:34.5047638Z Archive: artifacts.zip 2024-08-20T22:18:34.5049175Z creating: dist/ 2024-08-20T22:18:36.6354598Z inflating: dist/torch-2.5.0a0+git40ec5f6-cp310-cp310-linux_x86_64.whl 2024-08-20T22:18:36.6356097Z creating: build/custom_test_artifacts/ 2024-08-20T22:18:36.6356820Z creating: build/custom_test_artifacts/custom-op-build/ 2024-08-20T22:18:36.6358834Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2024-08-20T22:18:36.6360001Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2024-08-20T22:18:36.6363010Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2024-08-20T22:18:36.6363997Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/ 2024-08-20T22:18:36.6365086Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-08-20T22:18:36.6366081Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-08-20T22:18:36.6367047Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-08-20T22:18:36.6368683Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-08-20T22:18:36.6371038Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-08-20T22:18:36.6372150Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-08-20T22:18:36.6373145Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-08-20T22:18:36.6374509Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-08-20T22:18:36.6376118Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-08-20T22:18:36.6377971Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-08-20T22:18:36.6379321Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-08-20T22:18:36.6380766Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-08-20T22:18:36.6382239Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-08-20T22:18:36.6383377Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/ 2024-08-20T22:18:36.6384391Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/ 2024-08-20T22:18:36.6426546Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2024-08-20T22:18:36.6468471Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2024-08-20T22:18:36.6471332Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2024-08-20T22:18:36.6516849Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2024-08-20T22:18:36.6518935Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2024-08-20T22:18:36.6520659Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2024-08-20T22:18:36.6522332Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2024-08-20T22:18:36.6523977Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2024-08-20T22:18:36.6525618Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2024-08-20T22:18:36.6527484Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2024-08-20T22:18:36.6529003Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2024-08-20T22:18:36.6530491Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2024-08-20T22:18:36.6531783Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2024-08-20T22:18:36.6533020Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.reg.c 2024-08-20T22:18:36.6534226Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin 2024-08-20T22:18:36.6535441Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2024-08-20T22:18:36.6536632Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.o 2024-08-20T22:18:36.6537827Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/CMakeCUDACompilerId.cu 2024-08-20T22:18:36.6604639Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCUDA/a.out 2024-08-20T22:18:36.6678491Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CUDA.bin 2024-08-20T22:18:36.6680058Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCUDACompiler.cmake 2024-08-20T22:18:36.6681308Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2024-08-20T22:18:36.6682411Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2024-08-20T22:18:36.6683553Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2024-08-20T22:18:36.6684644Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2024-08-20T22:18:36.6685863Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2024-08-20T22:18:36.6687295Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2024-08-20T22:18:36.6688558Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2024-08-20T22:18:36.6689742Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2024-08-20T22:18:36.6690801Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2024-08-20T22:18:36.6691864Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2024-08-20T22:18:36.6692934Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2024-08-20T22:18:36.6694011Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2024-08-20T22:18:36.6695054Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2024-08-20T22:18:36.6710876Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2024-08-20T22:18:36.6854211Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2024-08-20T22:18:36.6855410Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2024-08-20T22:18:36.6857629Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2024-08-20T22:18:36.6860640Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2024-08-20T22:18:36.6863270Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2024-08-20T22:18:36.6865776Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2024-08-20T22:18:36.6867191Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2024-08-20T22:18:36.6868590Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2024-08-20T22:18:36.6869708Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2024-08-20T22:18:36.6870809Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2024-08-20T22:18:36.6871904Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2024-08-20T22:18:36.6884478Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2024-08-20T22:18:36.6968014Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2024-08-20T22:18:36.6969534Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-08-20T22:18:36.6970729Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2024-08-20T22:18:36.6972058Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2024-08-20T22:18:36.6973120Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2024-08-20T22:18:36.6974159Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2024-08-20T22:18:36.6975069Z inflating: build/custom_test_artifacts/custom-op-build/detect_cuda_version.cc 2024-08-20T22:18:36.6976758Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2024-08-20T22:18:36.6977881Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2024-08-20T22:18:36.6979032Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2024-08-20T22:18:36.7099251Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2024-08-20T22:18:36.7162589Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2024-08-20T22:18:36.7163402Z creating: build/custom_test_artifacts/jit-hook-build/ 2024-08-20T22:18:36.7164164Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2024-08-20T22:18:36.7164980Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2024-08-20T22:18:36.7172339Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2024-08-20T22:18:36.7173544Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/ 2024-08-20T22:18:36.7174635Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-08-20T22:18:36.7175600Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-08-20T22:18:36.7176696Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-08-20T22:18:36.7177817Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-08-20T22:18:36.7179197Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-08-20T22:18:36.7180275Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-08-20T22:18:36.7181249Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-08-20T22:18:36.7182614Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-08-20T22:18:36.7184785Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-08-20T22:18:36.7186640Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-08-20T22:18:36.7188110Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-08-20T22:18:36.7189498Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-08-20T22:18:36.7190844Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-08-20T22:18:36.7191973Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/ 2024-08-20T22:18:36.7192965Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/ 2024-08-20T22:18:36.7234902Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2024-08-20T22:18:36.7276360Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2024-08-20T22:18:36.7277830Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2024-08-20T22:18:36.7323347Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2024-08-20T22:18:36.7324954Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2024-08-20T22:18:36.7326818Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2024-08-20T22:18:36.7328584Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2024-08-20T22:18:36.7330113Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2024-08-20T22:18:36.7331691Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2024-08-20T22:18:36.7333276Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2024-08-20T22:18:36.7334874Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2024-08-20T22:18:36.7336409Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2024-08-20T22:18:36.7337804Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2024-08-20T22:18:36.7339020Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.reg.c 2024-08-20T22:18:36.7340212Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin 2024-08-20T22:18:36.7341403Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2024-08-20T22:18:36.7342570Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.o 2024-08-20T22:18:36.7343749Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/CMakeCUDACompilerId.cu 2024-08-20T22:18:36.7412400Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCUDA/a.out 2024-08-20T22:18:36.7486018Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CUDA.bin 2024-08-20T22:18:36.7487347Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCUDACompiler.cmake 2024-08-20T22:18:36.7488555Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2024-08-20T22:18:36.7489601Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2024-08-20T22:18:36.7490663Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2024-08-20T22:18:36.7491997Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2024-08-20T22:18:36.7493196Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2024-08-20T22:18:36.7494555Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2024-08-20T22:18:36.7495925Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2024-08-20T22:18:36.7497146Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2024-08-20T22:18:36.7498233Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2024-08-20T22:18:36.7499328Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2024-08-20T22:18:36.7500424Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2024-08-20T22:18:36.7501524Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2024-08-20T22:18:36.7502606Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2024-08-20T22:18:36.7518176Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2024-08-20T22:18:36.7582738Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2024-08-20T22:18:36.7584211Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-08-20T22:18:36.7585332Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2024-08-20T22:18:36.7586449Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2024-08-20T22:18:36.7587458Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2024-08-20T22:18:36.7588524Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2024-08-20T22:18:36.7589412Z inflating: build/custom_test_artifacts/jit-hook-build/detect_cuda_version.cc 2024-08-20T22:18:36.7591371Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2024-08-20T22:18:36.7592504Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2024-08-20T22:18:36.7593443Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2024-08-20T22:18:36.7643178Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2024-08-20T22:18:36.7643938Z creating: build/custom_test_artifacts/custom-backend-build/ 2024-08-20T22:18:36.7644699Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2024-08-20T22:18:36.7645579Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2024-08-20T22:18:36.7652269Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2024-08-20T22:18:36.7653251Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/ 2024-08-20T22:18:36.7654240Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-08-20T22:18:36.7655271Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-08-20T22:18:36.7656291Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-08-20T22:18:36.7657750Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-08-20T22:18:36.7659243Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-08-20T22:18:36.7660401Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-08-20T22:18:36.7661430Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-08-20T22:18:36.7663083Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-08-20T22:18:36.7664682Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-08-20T22:18:36.7666551Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-08-20T22:18:36.7668064Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-08-20T22:18:36.7669776Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-08-20T22:18:36.7671193Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-08-20T22:18:36.7672450Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/ 2024-08-20T22:18:36.7673493Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/ 2024-08-20T22:18:36.7714429Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp4.ii 2024-08-20T22:18:36.7756357Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.cpp 2024-08-20T22:18:36.7759729Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.module_id 2024-08-20T22:18:36.7804729Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cpp1.ii 2024-08-20T22:18:36.7806275Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.c 2024-08-20T22:18:36.7807885Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.gpu 2024-08-20T22:18:36.7809699Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.cudafe1.stub.c 2024-08-20T22:18:36.7811574Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.ptx 2024-08-20T22:18:36.7813339Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.sm_52.cubin 2024-08-20T22:18:36.7815232Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin 2024-08-20T22:18:36.7817056Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.fatbin.c 2024-08-20T22:18:36.7818843Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/CMakeCUDACompilerId.o 2024-08-20T22:18:36.7820416Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.sm_52.cubin 2024-08-20T22:18:36.7821740Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.reg.c 2024-08-20T22:18:36.7823001Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin 2024-08-20T22:18:36.7824278Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.fatbin.c 2024-08-20T22:18:36.7825504Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/tmp/a_dlink.o 2024-08-20T22:18:36.7826766Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/CMakeCUDACompilerId.cu 2024-08-20T22:18:36.7894723Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCUDA/a.out 2024-08-20T22:18:36.7968465Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CUDA.bin 2024-08-20T22:18:36.7970298Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCUDACompiler.cmake 2024-08-20T22:18:36.7971654Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2024-08-20T22:18:36.7972859Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2024-08-20T22:18:36.7974115Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2024-08-20T22:18:36.7975376Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2024-08-20T22:18:36.7976847Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2024-08-20T22:18:36.7978439Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2024-08-20T22:18:36.7979848Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2024-08-20T22:18:36.7980995Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2024-08-20T22:18:36.7982160Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2024-08-20T22:18:36.7983709Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2024-08-20T22:18:36.7984954Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2024-08-20T22:18:36.7986124Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2024-08-20T22:18:36.7987282Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2024-08-20T22:18:36.7988510Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2024-08-20T22:18:36.8109964Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2024-08-20T22:18:36.8111474Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2024-08-20T22:18:36.8113012Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2024-08-20T22:18:36.8114631Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2024-08-20T22:18:36.8116300Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2024-08-20T22:18:36.8117771Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2024-08-20T22:18:36.8119156Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2024-08-20T22:18:36.8120453Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2024-08-20T22:18:36.8121697Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2024-08-20T22:18:36.8122925Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2024-08-20T22:18:36.8124136Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2024-08-20T22:18:36.8139767Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2024-08-20T22:18:36.8195810Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2024-08-20T22:18:36.8198966Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-08-20T22:18:36.8202023Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2024-08-20T22:18:36.8204793Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2024-08-20T22:18:36.8206789Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2024-08-20T22:18:36.8207805Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2024-08-20T22:18:36.8208762Z inflating: build/custom_test_artifacts/custom-backend-build/detect_cuda_version.cc 2024-08-20T22:18:36.8209654Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2024-08-20T22:18:36.8210486Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2024-08-20T22:18:36.8211342Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2024-08-20T22:18:36.8310545Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2024-08-20T22:18:36.8353430Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2024-08-20T22:18:36.8354206Z creating: build/lib/ 2024-08-20T22:18:36.8442373Z inflating: build/lib/libprotobuf-lite.a 2024-08-20T22:18:36.8897296Z inflating: build/lib/libprotobuf.a 2024-08-20T22:18:36.8906966Z inflating: build/lib/libpthreadpool.a 2024-08-20T22:18:36.8915286Z inflating: build/lib/libcpuinfo.a 2024-08-20T22:18:36.8923571Z inflating: build/lib/libcpuinfo_internals.a 2024-08-20T22:18:36.8924225Z inflating: build/lib/libclog.a 2024-08-20T22:18:36.8943296Z inflating: build/lib/libnnpack.a 2024-08-20T22:18:36.8945844Z inflating: build/lib/libnnpack_reference_layers.a 2024-08-20T22:18:36.9010793Z inflating: build/lib/libgtest.a 2024-08-20T22:18:36.9085720Z inflating: build/lib/libbenchmark.a 2024-08-20T22:18:36.9149375Z inflating: build/lib/libasmjit.a 2024-08-20T22:18:36.9156800Z inflating: build/lib/libittnotify.a 2024-08-20T22:18:36.9185196Z inflating: build/lib/libtensorpipe_uv.a 2024-08-20T22:18:36.9313156Z inflating: build/lib/libgloo.a 2024-08-20T22:18:36.9333453Z inflating: build/lib/libfmt.a 2024-08-20T22:18:36.9429628Z inflating: build/lib/libc10.so 2024-08-20T22:18:36.9431343Z inflating: build/lib/libcaffe2_nvrtc.so 2024-08-20T22:18:36.9432637Z inflating: build/lib/libtorch_global_deps.so 2024-08-20T22:18:36.9452058Z inflating: build/lib/libpytorch_qnnpack.a 2024-08-20T22:18:36.9470185Z inflating: build/lib/libgmock.a 2024-08-20T22:18:36.9470955Z inflating: build/lib/libgtest_main.a 2024-08-20T22:18:36.9471974Z inflating: build/lib/libbenchmark_main.a 2024-08-20T22:18:36.9975362Z inflating: build/lib/libprotoc.a 2024-08-20T22:18:37.0364220Z inflating: build/lib/libgloo_cuda.a 2024-08-20T22:18:38.0535263Z inflating: build/lib/libdnnl.a 2024-08-20T22:18:38.1109355Z inflating: build/lib/libtensorpipe.a 2024-08-20T22:18:38.1166720Z inflating: build/lib/libc10_cuda.so 2024-08-20T22:18:38.1167937Z inflating: build/lib/libgmock_main.a 2024-08-20T22:18:38.2428103Z inflating: build/lib/libfbgemm.a 2024-08-20T22:18:38.2683183Z inflating: build/lib/libtensorpipe_cuda.a 2024-08-20T22:18:38.3189097Z inflating: build/lib/libkineto.a 2024-08-20T22:18:38.3373286Z inflating: build/lib/libXNNPACK.a 2024-08-20T22:18:38.3415810Z inflating: build/lib/libonnx_proto.a 2024-08-20T22:18:38.4109006Z inflating: build/lib/libonnx.a 2024-08-20T22:18:40.8580734Z inflating: build/lib/libtorch_cpu.so 2024-08-20T22:18:40.8585703Z inflating: build/lib/libunbox_lib.a 2024-08-20T22:18:40.8589982Z inflating: build/lib/libshm.so 2024-08-20T22:18:42.9172717Z inflating: build/lib/libtorch_cuda.so 2024-08-20T22:18:42.9174145Z inflating: build/lib/libtorch.so 2024-08-20T22:18:43.7943256Z inflating: build/lib/libtorch_cuda_linalg.so 2024-08-20T22:18:43.7945783Z inflating: build/lib/libc10d_cuda_test.so 2024-08-20T22:18:43.9943450Z inflating: build/lib/libtorch_python.so 2024-08-20T22:18:44.0015639Z inflating: build/lib/libtorchbind_test.so 2024-08-20T22:18:44.0036728Z inflating: build/lib/libjitbackend_test.so 2024-08-20T22:18:44.0062632Z inflating: build/lib/libbackend_with_compiler.so 2024-08-20T22:18:44.0089284Z inflating: build/lib/libaoti_custom_ops.so 2024-08-20T22:18:44.0124378Z inflating: build/lib/libnnapi_backend.so 2024-08-20T22:18:44.0124843Z creating: build/bin/ 2024-08-20T22:18:44.0175778Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2024-08-20T22:18:44.0227689Z inflating: build/bin/c10_DeviceGuard_test 2024-08-20T22:18:44.0279332Z inflating: build/bin/c10_Device_test 2024-08-20T22:18:44.0339576Z inflating: build/bin/c10_DispatchKeySet_test 2024-08-20T22:18:44.0392978Z inflating: build/bin/c10_Scalar_test 2024-08-20T22:18:44.0443029Z inflating: build/bin/c10_StreamGuard_test 2024-08-20T22:18:44.0494614Z inflating: build/bin/c10_SymInt_test 2024-08-20T22:18:44.0550363Z inflating: build/bin/c10_InlineDeviceGuard_test 2024-08-20T22:18:44.0606722Z inflating: build/bin/c10_InlineStreamGuard_test 2024-08-20T22:18:44.0663329Z inflating: build/bin/c10_SizesAndStrides_test 2024-08-20T22:18:44.0735466Z inflating: build/bin/c10_cow_test 2024-08-20T22:18:44.0789470Z inflating: build/bin/c10_Bitset_test 2024-08-20T22:18:44.0839008Z inflating: build/bin/c10_ConstexprCrc_test 2024-08-20T22:18:44.0890186Z inflating: build/bin/c10_DeadlockDetection_test 2024-08-20T22:18:44.0941870Z inflating: build/bin/c10_Half_test 2024-08-20T22:18:44.0999008Z inflating: build/bin/c10_LeftRight_test 2024-08-20T22:18:44.1054722Z inflating: build/bin/c10_Metaprogramming_test 2024-08-20T22:18:44.1105841Z inflating: build/bin/c10_Synchronized_test 2024-08-20T22:18:44.1162192Z inflating: build/bin/c10_ThreadLocal_test 2024-08-20T22:18:44.1214513Z inflating: build/bin/c10_TypeIndex_test 2024-08-20T22:18:44.1266315Z inflating: build/bin/c10_TypeList_test 2024-08-20T22:18:44.1316136Z inflating: build/bin/c10_TypeTraits_test 2024-08-20T22:18:44.1368833Z inflating: build/bin/c10_accumulate_test 2024-08-20T22:18:44.1425205Z inflating: build/bin/c10_bfloat16_test 2024-08-20T22:18:44.1476608Z inflating: build/bin/c10_bit_cast_test 2024-08-20T22:18:44.1533803Z inflating: build/bin/c10_complex_math_test 2024-08-20T22:18:44.1590441Z inflating: build/bin/c10_complex_test 2024-08-20T22:18:44.1643815Z inflating: build/bin/c10_exception_test 2024-08-20T22:18:44.1695182Z inflating: build/bin/c10_flags_test 2024-08-20T22:18:44.1746128Z inflating: build/bin/c10_generic_math_test 2024-08-20T22:18:44.1912490Z inflating: build/bin/c10_intrusive_ptr_test 2024-08-20T22:18:44.1963913Z inflating: build/bin/c10_irange_test 2024-08-20T22:18:44.2018546Z inflating: build/bin/c10_lazy_test 2024-08-20T22:18:44.2076601Z inflating: build/bin/c10_logging_test 2024-08-20T22:18:44.2152454Z inflating: build/bin/c10_optional_test 2024-08-20T22:18:44.2216552Z inflating: build/bin/c10_ordered_preserving_dict_test 2024-08-20T22:18:44.2271499Z inflating: build/bin/c10_registry_test 2024-08-20T22:18:44.2423967Z inflating: build/bin/c10_small_vector_test 2024-08-20T22:18:44.2476515Z inflating: build/bin/c10_ssize_test 2024-08-20T22:18:44.2529266Z inflating: build/bin/c10_string_util_test 2024-08-20T22:18:44.2589078Z inflating: build/bin/c10_string_view_test 2024-08-20T22:18:44.2640088Z inflating: build/bin/c10_tempfile_test 2024-08-20T22:18:44.2689596Z inflating: build/bin/c10_intrusive_ptr_benchmark 2024-08-20T22:18:44.2746411Z inflating: build/bin/c10_typeid_test 2024-08-20T22:18:44.3195139Z inflating: build/bin/protoc-3.13.0.0 2024-08-20T22:18:44.3644048Z inflating: build/bin/protoc 2024-08-20T22:18:44.3698289Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_1_var_test 2024-08-20T22:18:44.3751900Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_catches_stream 2024-08-20T22:18:44.3805822Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_catches_thread_and_block_and_device 2024-08-20T22:18:44.3858554Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_from_2_processes 2024-08-20T22:18:44.3912620Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_blocks_and_threads 2024-08-20T22:18:44.3966149Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_multiple_blocks 2024-08-20T22:18:44.4016251Z inflating: build/bin/c10_cuda_CUDATest 2024-08-20T22:18:44.4070339Z inflating: build/bin/c10_cuda_CUDAAssertionsTest_multiple_writes_from_same_block 2024-08-20T22:18:44.4402126Z inflating: build/bin/vec_test_all_types_DEFAULT 2024-08-20T22:18:44.4748512Z inflating: build/bin/vec_test_all_types_AVX512 2024-08-20T22:18:44.5109312Z inflating: build/bin/vec_test_all_types_AVX2 2024-08-20T22:18:44.5162698Z inflating: build/bin/BackoffTest 2024-08-20T22:18:44.5216981Z inflating: build/bin/FileStoreTest 2024-08-20T22:18:44.5274329Z inflating: build/bin/TCPStoreTest 2024-08-20T22:18:44.5328468Z inflating: build/bin/HashStoreTest 2024-08-20T22:18:44.5342775Z inflating: build/bin/ProcessGroupMPITest 2024-08-20T22:18:44.5397670Z inflating: build/bin/test_edge_op_registration 2024-08-20T22:18:44.5402663Z inflating: build/bin/torch_shm_manager 2024-08-20T22:18:44.5406222Z inflating: build/bin/example_allreduce 2024-08-20T22:18:44.5462288Z inflating: build/bin/test_dist_autograd 2024-08-20T22:18:44.5532111Z inflating: build/bin/test_cpp_rpc 2024-08-20T22:18:44.5535329Z inflating: build/bin/parallel_benchmark 2024-08-20T22:18:44.5603798Z inflating: build/bin/test_mobile_nnc 2024-08-20T22:18:44.5613314Z inflating: build/bin/aot_model_compiler_test 2024-08-20T22:18:44.5961780Z inflating: build/bin/test_lazy 2024-08-20T22:18:44.7146672Z inflating: build/bin/test_api 2024-08-20T22:18:44.7221586Z inflating: build/bin/Dict_test 2024-08-20T22:18:44.7274534Z inflating: build/bin/Dimname_test 2024-08-20T22:18:44.7339830Z inflating: build/bin/MaybeOwned_test 2024-08-20T22:18:44.7398210Z inflating: build/bin/NamedTensor_test 2024-08-20T22:18:44.7458145Z inflating: build/bin/apply_utils_test 2024-08-20T22:18:44.7518178Z inflating: build/bin/atest 2024-08-20T22:18:44.7582465Z inflating: build/bin/basic 2024-08-20T22:18:44.7638130Z inflating: build/bin/broadcast_test 2024-08-20T22:18:44.7689976Z inflating: build/bin/cpu_allocator_test 2024-08-20T22:18:44.7748932Z inflating: build/bin/cpu_generator_test 2024-08-20T22:18:44.7803393Z inflating: build/bin/cpu_profiling_allocator_test 2024-08-20T22:18:44.7897619Z inflating: build/bin/cpu_rng_test 2024-08-20T22:18:44.7948442Z inflating: build/bin/dispatch_key_set_test 2024-08-20T22:18:44.7999942Z inflating: build/bin/dlconvertor_test 2024-08-20T22:18:44.8058886Z inflating: build/bin/extension_backend_test 2024-08-20T22:18:44.8115089Z inflating: build/bin/half_test 2024-08-20T22:18:44.8212731Z inflating: build/bin/ivalue_test 2024-08-20T22:18:44.8263156Z inflating: build/bin/lazy_tensor_test 2024-08-20T22:18:44.8318613Z inflating: build/bin/math_kernel_test 2024-08-20T22:18:44.8373641Z inflating: build/bin/memory_format_test 2024-08-20T22:18:44.8428348Z inflating: build/bin/memory_overlapping_test 2024-08-20T22:18:44.8482846Z inflating: build/bin/mobile_memory_cleanup 2024-08-20T22:18:44.8539715Z inflating: build/bin/native_test 2024-08-20T22:18:44.8591431Z inflating: build/bin/operator_name_test 2024-08-20T22:18:44.8643806Z inflating: build/bin/operators_test 2024-08-20T22:18:44.8696536Z inflating: build/bin/packedtensoraccessor_test 2024-08-20T22:18:44.8764530Z inflating: build/bin/pow_test 2024-08-20T22:18:44.8822812Z inflating: build/bin/quantized_test 2024-08-20T22:18:44.8873926Z inflating: build/bin/reduce_ops_test 2024-08-20T22:18:44.8925843Z inflating: build/bin/reportMemoryUsage_test 2024-08-20T22:18:44.8983451Z inflating: build/bin/scalar_tensor_test 2024-08-20T22:18:44.9042907Z inflating: build/bin/scalar_test 2024-08-20T22:18:44.9095594Z inflating: build/bin/StorageUtils_test 2024-08-20T22:18:44.9149078Z inflating: build/bin/stride_properties_test 2024-08-20T22:18:44.9229131Z inflating: build/bin/tensor_iterator_test 2024-08-20T22:18:44.9284187Z inflating: build/bin/test_parallel 2024-08-20T22:18:44.9287771Z inflating: build/bin/thread_init_test 2024-08-20T22:18:44.9344414Z inflating: build/bin/type_ptr_test 2024-08-20T22:18:44.9406026Z inflating: build/bin/type_test 2024-08-20T22:18:44.9459075Z inflating: build/bin/undefined_tensor_test 2024-08-20T22:18:44.9461021Z inflating: build/bin/verify_api_visibility 2024-08-20T22:18:44.9531488Z inflating: build/bin/legacy_vmap_test 2024-08-20T22:18:44.9583989Z inflating: build/bin/weakref_test 2024-08-20T22:18:44.9636090Z inflating: build/bin/wrapdim_test 2024-08-20T22:18:44.9689186Z inflating: build/bin/xla_tensor_test 2024-08-20T22:18:44.9750402Z inflating: build/bin/IListRef_test 2024-08-20T22:18:44.9857787Z inflating: build/bin/List_test 2024-08-20T22:18:44.9925317Z inflating: build/bin/KernelFunction_test 2024-08-20T22:18:45.0048548Z inflating: build/bin/kernel_function_legacy_test 2024-08-20T22:18:45.0146656Z inflating: build/bin/kernel_function_test 2024-08-20T22:18:45.0276520Z inflating: build/bin/kernel_lambda_legacy_test 2024-08-20T22:18:45.0381819Z inflating: build/bin/kernel_lambda_test 2024-08-20T22:18:45.0443786Z inflating: build/bin/kernel_stackbased_test 2024-08-20T22:18:45.0541430Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2024-08-20T22:18:45.0593405Z inflating: build/bin/CppSignature_test 2024-08-20T22:18:45.0649720Z inflating: build/bin/backend_fallback_test 2024-08-20T22:18:45.0699736Z inflating: build/bin/op_allowlist_test 2024-08-20T22:18:45.0764139Z inflating: build/bin/inline_container_test 2024-08-20T22:18:45.1069279Z inflating: build/bin/op_registration_test 2024-08-20T22:18:45.1122816Z inflating: build/bin/cuda_apply_test 2024-08-20T22:18:45.1175711Z inflating: build/bin/cuda_allocator_test 2024-08-20T22:18:45.1231085Z inflating: build/bin/cuda_caching_host_allocator_test 2024-08-20T22:18:45.1291278Z inflating: build/bin/cuda_atomic_ops_test 2024-08-20T22:18:45.1362105Z inflating: build/bin/cuda_complex_math_test 2024-08-20T22:18:45.1422724Z inflating: build/bin/cuda_complex_test 2024-08-20T22:18:45.1473452Z inflating: build/bin/cuda_device_test 2024-08-20T22:18:45.1532396Z inflating: build/bin/cuda_cub_test 2024-08-20T22:18:45.1584541Z inflating: build/bin/cuda_dlconvertor_test 2024-08-20T22:18:45.1650033Z inflating: build/bin/cuda_distributions_test 2024-08-20T22:18:45.1707903Z inflating: build/bin/cuda_generator_test 2024-08-20T22:18:45.1758320Z inflating: build/bin/cuda_half_test 2024-08-20T22:18:45.1810731Z inflating: build/bin/cuda_integer_divider_test 2024-08-20T22:18:45.1860909Z inflating: build/bin/cuda_optional_test 2024-08-20T22:18:45.1913774Z inflating: build/bin/cuda_packedtensoraccessor_test 2024-08-20T22:18:45.1967185Z inflating: build/bin/cuda_reportMemoryUsage_test 2024-08-20T22:18:45.2018192Z inflating: build/bin/cuda_allocatorTraceTracker_test 2024-08-20T22:18:45.2079944Z inflating: build/bin/cuda_stream_test 2024-08-20T22:18:45.2130206Z inflating: build/bin/cuda_cudnn_test 2024-08-20T22:18:45.2183426Z inflating: build/bin/cuda_vectorized_test 2024-08-20T22:18:45.2198430Z inflating: build/bin/tutorial_tensorexpr 2024-08-20T22:18:45.2264949Z inflating: build/bin/ProcessGroupGlooTest 2024-08-20T22:18:45.2323388Z inflating: build/bin/ProcessGroupGlooAsyncTest 2024-08-20T22:18:45.2388019Z inflating: build/bin/ProcessGroupNCCLTest 2024-08-20T22:18:45.2451049Z inflating: build/bin/ProcessGroupNCCLErrorsTest 2024-08-20T22:18:45.3282307Z inflating: build/bin/test_tensorexpr 2024-08-20T22:18:45.3866345Z inflating: build/bin/test_jit 2024-08-20T22:18:45.3867228Z creating: .additional_ci_files/ 2024-08-20T22:18:45.3928896Z inflating: .additional_ci_files/test-times.json 2024-08-20T22:18:45.4170762Z inflating: .additional_ci_files/test-class-times.json 2024-08-20T22:18:45.4206485Z ##[group]Run rm artifacts.zip 2024-08-20T22:18:45.4206873Z rm artifacts.zip 2024-08-20T22:18:45.4215754Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:45.4216258Z env: 2024-08-20T22:18:45.4216541Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:45.4216997Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:45.4217481Z ##[endgroup] 2024-08-20T22:18:45.5563865Z ##[group]Run df -H 2024-08-20T22:18:45.5564302Z df -H 2024-08-20T22:18:45.5573793Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:45.5574292Z env: 2024-08-20T22:18:45.5574562Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:45.5575003Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:45.5575486Z ##[endgroup] 2024-08-20T22:18:45.5624220Z Filesystem Size Used Avail Use% Mounted on 2024-08-20T22:18:45.5624753Z devtmpfs 4.2M 0 4.2M 0% /dev 2024-08-20T22:18:45.5625358Z tmpfs 34G 0 34G 0% /dev/shm 2024-08-20T22:18:45.5625948Z tmpfs 14G 553k 14G 1% /run 2024-08-20T22:18:45.5626399Z /dev/nvme0n1p1 161G 47G 115G 29% / 2024-08-20T22:18:45.5627060Z tmpfs 34G 4.1k 34G 1% /tmp 2024-08-20T22:18:45.5627543Z /dev/nvme0n1p128 11M 1.4M 9.2M 13% /boot/efi 2024-08-20T22:18:45.5628378Z tmpfs 6.7G 0 6.7G 0% /run/user/0 2024-08-20T22:18:45.5667156Z Prepare all required actions 2024-08-20T22:18:45.5667615Z Getting action download info 2024-08-20T22:18:45.6868001Z ##[group]Run ./.github/actions/download-td-artifacts 2024-08-20T22:18:45.6868491Z with: 2024-08-20T22:18:45.6868782Z env: 2024-08-20T22:18:45.6869053Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:45.6869490Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:45.6869981Z ##[endgroup] 2024-08-20T22:18:45.6902370Z ##[group]Run seemethere/download-artifact-s3@v4 2024-08-20T22:18:45.6902801Z with: 2024-08-20T22:18:45.6903062Z name: td_results 2024-08-20T22:18:45.6903369Z s3-bucket: gha-artifacts 2024-08-20T22:18:45.6903712Z region: us-east-1 2024-08-20T22:18:45.6904005Z env: 2024-08-20T22:18:45.6904267Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:45.6904705Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:45.6905178Z ##[endgroup] 2024-08-20T22:18:46.1586267Z (node:52470) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-08-20T22:18:46.1587054Z 2024-08-20T22:18:46.1587335Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-08-20T22:18:46.1588117Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-08-20T22:18:46.1589026Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-08-20T22:18:46.2271601Z Found 1 objects with prefix pytorch/pytorch/10479310961/td_results/ 2024-08-20T22:18:46.2272684Z Starting download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/td_results.json 2024-08-20T22:18:46.3015044Z Finished download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/td_results.json 2024-08-20T22:18:46.3021026Z Artifact download has finished successfully 2024-08-20T22:18:46.3345182Z ##[group]Run mkdir -p .additional_ci_files 2024-08-20T22:18:46.3345662Z mkdir -p .additional_ci_files 2024-08-20T22:18:46.3346225Z mv td_results.json .additional_ci_files/td_results.json 2024-08-20T22:18:46.3355861Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:46.3356360Z env: 2024-08-20T22:18:46.3356637Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:46.3357092Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:46.3357579Z ##[endgroup] 2024-08-20T22:18:46.3478559Z ##[group]Run .github/scripts/parse_ref.py 2024-08-20T22:18:46.3479214Z .github/scripts/parse_ref.py 2024-08-20T22:18:46.3487522Z shell: /usr/bin/bash -e {0} 2024-08-20T22:18:46.3487870Z env: 2024-08-20T22:18:46.3488161Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:46.3488664Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:46.3489138Z ##[endgroup] 2024-08-20T22:18:46.3773244Z Prepare all required actions 2024-08-20T22:18:46.3811361Z ##[group]Run ./.github/actions/get-workflow-job-id 2024-08-20T22:18:46.3811810Z with: 2024-08-20T22:18:46.3812375Z github-token: *** 2024-08-20T22:18:46.3812692Z env: 2024-08-20T22:18:46.3812971Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:46.3813435Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:46.3813919Z ##[endgroup] 2024-08-20T22:18:46.3830426Z ##[group]Run set -eux 2024-08-20T22:18:46.3830764Z set -eux 2024-08-20T22:18:46.3831346Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2024-08-20T22:18:46.3840250Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:46.3840747Z env: 2024-08-20T22:18:46.3841033Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:46.3841481Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:46.3842110Z GITHUB_TOKEN: *** 2024-08-20T22:18:46.3842420Z ##[endgroup] 2024-08-20T22:18:46.3875150Z + python3 .github/scripts/get_workflow_job_id.py 10479310961 i-0b43e2cc0d7540218 2024-08-20T22:18:47.5433320Z setting job-id=29026448828 2024-08-20T22:18:47.5434222Z setting job-name=linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:18:47.5651497Z Prepare all required actions 2024-08-20T22:18:47.5651950Z Getting action download info 2024-08-20T22:18:47.6732257Z ##[group]Run ./.github/actions/filter-test-configs 2024-08-20T22:18:47.6732709Z with: 2024-08-20T22:18:47.6733165Z github-token: *** 2024-08-20T22:18:47.6735388Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 2, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 3, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 4, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 5, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}]} 2024-08-20T22:18:47.6738056Z job-name: linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:18:47.6738815Z env: 2024-08-20T22:18:47.6739130Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:47.6739620Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:47.6740115Z ##[endgroup] 2024-08-20T22:18:47.6782589Z ##[group]Run nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482 2024-08-20T22:18:47.6783143Z with: 2024-08-20T22:18:47.6783419Z shell: bash 2024-08-20T22:18:47.6783730Z timeout_minutes: 10 2024-08-20T22:18:47.6784061Z max_attempts: 5 2024-08-20T22:18:47.6784380Z retry_wait_seconds: 30 2024-08-20T22:18:47.6785460Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.1 2024-08-20T22:18:47.6786603Z polling_interval_seconds: 1 2024-08-20T22:18:47.6786984Z warning_on_retry: true 2024-08-20T22:18:47.6787346Z continue_on_error: false 2024-08-20T22:18:47.6787681Z env: 2024-08-20T22:18:47.6787961Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:47.6788413Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:47.6789046Z GITHUB_TOKEN: *** 2024-08-20T22:18:47.6789365Z ##[endgroup] 2024-08-20T22:18:47.7530640Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.1 2024-08-20T22:18:47.9927747Z Defaulting to user installation because normal site-packages is not writeable 2024-08-20T22:18:48.1462655Z Collecting requests==2.27.1 2024-08-20T22:18:48.1844251Z Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB) 2024-08-20T22:18:48.3830815Z Collecting pyyaml==6.0.1 2024-08-20T22:18:48.3907893Z Downloading PyYAML-6.0.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (738 kB) 2024-08-20T22:18:48.7902067Z Collecting charset-normalizer~=2.0.0 2024-08-20T22:18:48.7954113Z Downloading charset_normalizer-2.0.12-py3-none-any.whl (39 kB) 2024-08-20T22:18:48.8221333Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (2.10) 2024-08-20T22:18:48.8225393Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (1.25.10) 2024-08-20T22:18:48.8907591Z Collecting certifi>=2017.4.17 2024-08-20T22:18:48.8956362Z Downloading certifi-2024.7.4-py3-none-any.whl (162 kB) 2024-08-20T22:18:49.0060999Z Installing collected packages: charset-normalizer, certifi, requests, pyyaml 2024-08-20T22:18:49.2890768Z Successfully installed certifi-2024.7.4 charset-normalizer-2.0.12 pyyaml-6.0.1 requests-2.27.1 2024-08-20T22:18:49.7320038Z Command completed after 1 attempt(s). 2024-08-20T22:18:49.7378061Z ##[group]Run set -x 2024-08-20T22:18:49.7378392Z set -x 2024-08-20T22:18:49.7378686Z  2024-08-20T22:18:49.7379224Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-08-20T22:18:49.7379888Z # in runner workspace 2024-08-20T22:18:49.7380412Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2024-08-20T22:18:49.7389615Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:49.7390120Z env: 2024-08-20T22:18:49.7390406Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:49.7390848Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:49.7391328Z ##[endgroup] 2024-08-20T22:18:49.7421025Z + python3 /home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2024-08-20T22:18:49.7684967Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2024-08-20T22:18:49.7685496Z echo "Workflow: ${GITHUB_WORKFLOW}" 2024-08-20T22:18:49.7685949Z echo "Job name: ${JOB_NAME}" 2024-08-20T22:18:49.7686345Z  2024-08-20T22:18:49.7686883Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-08-20T22:18:49.7687548Z # in runner workspace 2024-08-20T22:18:49.7688118Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2024-08-20T22:18:49.7688748Z  --workflow "${GITHUB_WORKFLOW}" \ 2024-08-20T22:18:49.7689216Z  --job-name "${JOB_NAME}" \ 2024-08-20T22:18:49.7691614Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 2, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 3, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 4, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 5, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}]}" \ 2024-08-20T22:18:49.7693989Z  --selected-test-configs "" \ 2024-08-20T22:18:49.7694438Z  --pr-number "${PR_NUMBER}" \ 2024-08-20T22:18:49.7694853Z  --tag "${TAG}" \ 2024-08-20T22:18:49.7695239Z  --event-name "${EVENT_NAME}" \ 2024-08-20T22:18:49.7695682Z  --schedule "${SCHEDULE}" \ 2024-08-20T22:18:49.7696112Z  --branch "${HEAD_BRANCH}" 2024-08-20T22:18:49.7705384Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:49.7705873Z env: 2024-08-20T22:18:49.7706157Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:49.7706598Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:49.7707266Z GITHUB_TOKEN: *** 2024-08-20T22:18:49.7707971Z JOB_NAME: linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:18:49.7708933Z PR_NUMBER: 2024-08-20T22:18:49.7709237Z TAG: ciflow/trunk/133712 2024-08-20T22:18:49.7709588Z EVENT_NAME: push 2024-08-20T22:18:49.7709897Z SCHEDULE: 2024-08-20T22:18:49.7710182Z HEAD_BRANCH: 2024-08-20T22:18:49.7710478Z ##[endgroup] 2024-08-20T22:18:49.7739599Z Workflow: trunk 2024-08-20T22:18:49.7740588Z Job name: linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:18:50.0288461Z INFO:root:Found no test-config label on the PR, so all test configs are included 2024-08-20T22:18:50.1808904Z ##[group]Run echo "Filtered matrix:" 2024-08-20T22:18:50.1809353Z echo "Filtered matrix:" 2024-08-20T22:18:50.1811643Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 2, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 3, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 4, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}, {"config": "default", "shard": 5, "num_shards": 5, "runner": "amz2023.linux.g5.4xlarge.nvidia.gpu"}]}" 2024-08-20T22:18:50.1813924Z  2024-08-20T22:18:50.1814199Z echo 2024-08-20T22:18:50.1814576Z echo "Is the current job unstable? False" 2024-08-20T22:18:50.1815031Z  2024-08-20T22:18:50.1815304Z echo 2024-08-20T22:18:50.1815877Z echo "Is keep-going label set? False" 2024-08-20T22:18:50.1816320Z  2024-08-20T22:18:50.1816590Z echo 2024-08-20T22:18:50.1816913Z echo "Renabled issues? " 2024-08-20T22:18:50.1825800Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:50.1826291Z env: 2024-08-20T22:18:50.1826577Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:50.1827033Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:50.1827506Z ##[endgroup] 2024-08-20T22:18:50.1860021Z Filtered matrix: 2024-08-20T22:18:50.1863275Z {include: [{config: default, shard: 1, num_shards: 5, runner: amz2023.linux.g5.4xlarge.nvidia.gpu}, {config: default, shard: 2, num_shards: 5, runner: amz2023.linux.g5.4xlarge.nvidia.gpu}, {config: default, shard: 3, num_shards: 5, runner: amz2023.linux.g5.4xlarge.nvidia.gpu}, {config: default, shard: 4, num_shards: 5, runner: amz2023.linux.g5.4xlarge.nvidia.gpu}, {config: default, shard: 5, num_shards: 5, runner: amz2023.linux.g5.4xlarge.nvidia.gpu}]} 2024-08-20T22:18:50.1865540Z 2024-08-20T22:18:50.1865697Z Is the current job unstable? False 2024-08-20T22:18:50.1865988Z 2024-08-20T22:18:50.1866358Z Is keep-going label set? False 2024-08-20T22:18:50.1866615Z 2024-08-20T22:18:50.1866740Z Renabled issues? 2024-08-20T22:18:50.1919260Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-08-20T22:18:50.1920090Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-08-20T22:18:50.1929004Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T22:18:50.1929499Z env: 2024-08-20T22:18:50.1929786Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:50.1930241Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:50.1930726Z JOB_TIMEOUT: 240 2024-08-20T22:18:50.1931037Z ##[endgroup] 2024-08-20T22:18:50.2055370Z ##[group]Run set -x 2024-08-20T22:18:50.2055761Z set -x 2024-08-20T22:18:50.2056053Z  2024-08-20T22:18:50.2056399Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2024-08-20T22:18:50.2056923Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2024-08-20T22:18:50.2057473Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2024-08-20T22:18:50.2057974Z  TEST_COMMAND=.ci/onnx/test.sh 2024-08-20T22:18:50.2058371Z else 2024-08-20T22:18:50.2058808Z  TEST_COMMAND=.ci/pytorch/test.sh 2024-08-20T22:18:50.2059458Z fi 2024-08-20T22:18:50.2059748Z  2024-08-20T22:18:50.2060215Z # detached container should get cleaned up by teardown_ec2_linux 2024-08-20T22:18:50.2060961Z # TODO: Stop building test binaries as part of the build phase 2024-08-20T22:18:50.2061621Z # Used for GPU_FLAG since that doesn't play nice 2024-08-20T22:18:50.2062189Z # shellcheck disable=SC2086,SC2090 2024-08-20T22:18:50.2062637Z container_name=$(docker run \ 2024-08-20T22:18:50.2063050Z  ${GPU_FLAG:-} \ 2024-08-20T22:18:50.2063422Z  -e BUILD_ENVIRONMENT \ 2024-08-20T22:18:50.2063807Z  -e PR_NUMBER \ 2024-08-20T22:18:50.2064164Z  -e GITHUB_ACTIONS \ 2024-08-20T22:18:50.2064549Z  -e GITHUB_REPOSITORY \ 2024-08-20T22:18:50.2064938Z  -e GITHUB_WORKFLOW \ 2024-08-20T22:18:50.2065316Z  -e GITHUB_JOB \ 2024-08-20T22:18:50.2065675Z  -e GITHUB_RUN_ID \ 2024-08-20T22:18:50.2066061Z  -e GITHUB_RUN_NUMBER \ 2024-08-20T22:18:50.2066457Z  -e GITHUB_RUN_ATTEMPT \ 2024-08-20T22:18:50.2066923Z  -e JOB_ID \ 2024-08-20T22:18:50.2067261Z  -e JOB_NAME \ 2024-08-20T22:18:50.2067599Z  -e BASE_SHA \ 2024-08-20T22:18:50.2068213Z  -e BRANCH \ 2024-08-20T22:18:50.2068539Z  -e SHA1 \ 2024-08-20T22:18:50.2068873Z  -e AWS_DEFAULT_REGION \ 2024-08-20T22:18:50.2069289Z  -e IN_WHEEL_TEST \ 2024-08-20T22:18:50.2069691Z  -e SHARD_NUMBER \ 2024-08-20T22:18:50.2070050Z  -e TEST_CONFIG \ 2024-08-20T22:18:50.2070413Z  -e NUM_TEST_SHARDS \ 2024-08-20T22:18:50.2070798Z  -e REENABLED_ISSUES \ 2024-08-20T22:18:50.2071197Z  -e CONTINUE_THROUGH_ERROR \ 2024-08-20T22:18:50.2071613Z  -e VERBOSE_TEST_LOGS \ 2024-08-20T22:18:50.2072008Z  -e TEST_SHOWLOCALS \ 2024-08-20T22:18:50.2072387Z  -e NO_TEST_TIMEOUT \ 2024-08-20T22:18:50.2072758Z  -e NO_TD \ 2024-08-20T22:18:50.2073093Z  -e TD_DISTRIBUTED \ 2024-08-20T22:18:50.2073460Z  -e PR_LABELS \ 2024-08-20T22:18:50.2073856Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2024-08-20T22:18:50.2074305Z  -e SCCACHE_BUCKET \ 2024-08-20T22:18:50.2074686Z  -e SCCACHE_S3_KEY_PREFIX \ 2024-08-20T22:18:50.2075083Z  -e XLA_CUDA \ 2024-08-20T22:18:50.2075483Z  -e XLA_CLANG_CACHE_S3_BUCKET_NAME \ 2024-08-20T22:18:50.2075973Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2024-08-20T22:18:50.2076495Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2024-08-20T22:18:50.2076998Z  -e SKIP_SCCACHE_INITIALIZATION=1 \ 2024-08-20T22:18:50.2077451Z  -e HUGGING_FACE_HUB_TOKEN \ 2024-08-20T22:18:50.2077901Z  -e SCRIBE_GRAPHQL_ACCESS_TOKEN \ 2024-08-20T22:18:50.2078335Z  -e DASHBOARD_TAG \ 2024-08-20T22:18:50.2078800Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2024-08-20T22:18:50.2079389Z  --security-opt seccomp=unconfined \ 2024-08-20T22:18:50.2079915Z  --cap-add=SYS_PTRACE \ 2024-08-20T22:18:50.2080297Z  --ipc=host \ 2024-08-20T22:18:50.2080638Z  --shm-size="${SHM_SIZE}" \ 2024-08-20T22:18:50.2081025Z  --tty \ 2024-08-20T22:18:50.2081330Z  --detach \ 2024-08-20T22:18:50.2081842Z  --name="${container_name}" \ 2024-08-20T22:18:50.2082263Z  --user jenkins \ 2024-08-20T22:18:50.2082740Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2024-08-20T22:18:50.2083284Z  -w /var/lib/jenkins/workspace \ 2024-08-20T22:18:50.2083715Z  "${DOCKER_IMAGE}" 2024-08-20T22:18:50.2084061Z ) 2024-08-20T22:18:50.2084457Z # Propagate download.pytorch.org IP to container 2024-08-20T22:18:50.2085352Z grep download.pytorch.org /etc/hosts | docker exec -i "${container_name}" sudo bash -c "/bin/cat >> /etc/hosts" 2024-08-20T22:18:50.2086412Z echo "DOCKER_CONTAINER_ID=${container_name}" >> "${GITHUB_ENV}" 2024-08-20T22:18:50.2087283Z docker exec -t "${container_name}" sh -c "pip install $(echo dist/*.whl)[opt-einsum] && ${TEST_COMMAND}" 2024-08-20T22:18:50.2095830Z shell: /usr/bin/bash -e {0} 2024-08-20T22:18:50.2096184Z env: 2024-08-20T22:18:50.2096459Z GIT_DEFAULT_BRANCH: main 2024-08-20T22:18:50.2096917Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:18:50.2097515Z BUILD_ENVIRONMENT: linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:18:50.2098011Z PR_NUMBER: 2024-08-20T22:18:50.2098331Z GITHUB_REPOSITORY: pytorch/pytorch 2024-08-20T22:18:50.2098741Z GITHUB_WORKFLOW: trunk 2024-08-20T22:18:50.2099082Z GITHUB_JOB: test 2024-08-20T22:18:50.2099393Z GITHUB_RUN_ID: 10479310961 2024-08-20T22:18:50.2099760Z GITHUB_RUN_NUMBER: 92245 2024-08-20T22:18:50.2100112Z GITHUB_RUN_ATTEMPT: 1 2024-08-20T22:18:50.2100442Z JOB_ID: 29026448828 2024-08-20T22:18:50.2101146Z JOB_NAME: linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:18:50.2101907Z BRANCH: 2024-08-20T22:18:50.2102244Z SHA1: 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:18:50.2102762Z BASE_SHA: 40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:18:50.2103226Z TEST_CONFIG: default 2024-08-20T22:18:50.2103550Z SHARD_NUMBER: 1 2024-08-20T22:18:50.2103865Z NUM_TEST_SHARDS: 5 2024-08-20T22:18:50.2104191Z REENABLED_ISSUES: 2024-08-20T22:18:50.2104527Z CONTINUE_THROUGH_ERROR: False 2024-08-20T22:18:50.2104915Z VERBOSE_TEST_LOGS: False 2024-08-20T22:18:50.2105272Z TEST_SHOWLOCALS: False 2024-08-20T22:18:50.2105616Z NO_TEST_TIMEOUT: False 2024-08-20T22:18:50.2105949Z NO_TD: False 2024-08-20T22:18:50.2106252Z TD_DISTRIBUTED: False 2024-08-20T22:18:50.2106659Z SCCACHE_BUCKET: ossci-compiler-cache-circleci-v2 2024-08-20T22:18:50.2107140Z SCCACHE_S3_KEY_PREFIX: trunk 2024-08-20T22:18:50.2107501Z SHM_SIZE: 2g 2024-08-20T22:18:50.2108422Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:18:50.2109431Z XLA_CUDA: 2024-08-20T22:18:50.2109907Z XLA_CLANG_CACHE_S3_BUCKET_NAME: ossci-compiler-clang-cache-circleci-xla 2024-08-20T22:18:50.2110552Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 0 2024-08-20T22:18:50.2110991Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2024-08-20T22:18:50.2111401Z DASHBOARD_TAG: 2024-08-20T22:18:50.2111715Z HUGGING_FACE_HUB_TOKEN: 2024-08-20T22:18:50.2112080Z SCRIBE_GRAPHQL_ACCESS_TOKEN: 2024-08-20T22:18:50.2112451Z ##[endgroup] 2024-08-20T22:18:50.2139878Z + [[ default == \m\u\l\t\i\g\p\u ]] 2024-08-20T22:18:50.2140631Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *onnx* ]] 2024-08-20T22:18:50.2141138Z + TEST_COMMAND=.ci/pytorch/test.sh 2024-08-20T22:18:50.2148732Z +++ nproc --ignore=2 2024-08-20T22:18:50.2353047Z ++ docker run --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e TD_DISTRIBUTED -e PR_LABELS -e MAX_JOBS=14 -e SCCACHE_BUCKET -e SCCACHE_S3_KEY_PREFIX -e XLA_CUDA -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e SKIP_SCCACHE_INITIALIZATION=1 -e HUGGING_FACE_HUB_TOKEN -e SCRIBE_GRAPHQL_ACCESS_TOKEN -e DASHBOARD_TAG --env-file=/tmp/github_env_10479310961 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --ipc=host --shm-size=2g --tty --detach --name= --user jenkins -v /home/ec2-user/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T22:18:59.0366029Z + container_name=00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T22:18:59.0369070Z + grep download.pytorch.org /etc/hosts 2024-08-20T22:18:59.0370992Z + docker exec -i 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe sudo bash -c '/bin/cat >> /etc/hosts' 2024-08-20T22:18:59.1731546Z + echo DOCKER_CONTAINER_ID=00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T22:18:59.1736361Z ++ echo dist/torch-2.5.0a0+git40ec5f6-cp310-cp310-linux_x86_64.whl 2024-08-20T22:18:59.1739591Z + docker exec -t 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe sh -c 'pip install dist/torch-2.5.0a0+git40ec5f6-cp310-cp310-linux_x86_64.whl[opt-einsum] && .ci/pytorch/test.sh' 2024-08-20T22:18:59.5645627Z Processing ./dist/torch-2.5.0a0+git40ec5f6-cp310-cp310-linux_x86_64.whl (from torch==2.5.0a0+git40ec5f6) 2024-08-20T22:18:59.8870798Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (3.13.1) 2024-08-20T22:18:59.8873799Z Requirement already satisfied: typing-extensions>=4.8.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (4.12.2) 2024-08-20T22:18:59.8876355Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (2.8.8) 2024-08-20T22:18:59.8879459Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (3.1.4) 2024-08-20T22:18:59.8882456Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (2024.6.1) 2024-08-20T22:18:59.8887173Z Requirement already satisfied: sympy==1.13.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (1.13.1) 2024-08-20T22:18:59.8903175Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy==1.13.1->torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (1.3.0) 2024-08-20T22:18:59.8915996Z Requirement already satisfied: opt-einsum>=3.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (3.3.0) 2024-08-20T22:18:59.8933261Z Requirement already satisfied: numpy>=1.7 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from opt-einsum>=3.3->torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (1.22.4) 2024-08-20T22:18:59.9323033Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.5.0a0+git40ec5f6->torch==2.5.0a0+git40ec5f6) (2.1.5) 2024-08-20T22:19:00.2521029Z Installing collected packages: torch 2024-08-20T22:19:10.1344861Z Successfully installed torch-2.5.0a0+git40ec5f6 2024-08-20T22:19:10.2089055Z + export TERM=vt100 2024-08-20T22:19:10.2089388Z + TERM=vt100 2024-08-20T22:19:10.2092424Z ++ dirname .ci/pytorch/test.sh 2024-08-20T22:19:10.2103108Z + source .ci/pytorch/common.sh 2024-08-20T22:19:10.2107211Z +++ dirname .ci/pytorch/common.sh 2024-08-20T22:19:10.2118169Z ++ source .ci/pytorch/common_utils.sh 2024-08-20T22:19:10.2119128Z +++ declare -f -t trap_add 2024-08-20T22:19:10.2124889Z ++ set -ex 2024-08-20T22:19:10.2125476Z ++ [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *rocm* ]] 2024-08-20T22:19:10.2126105Z ++ BUILD_TEST_LIBTORCH=0 2024-08-20T22:19:10.2126749Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *rocm* ]] 2024-08-20T22:19:10.2129440Z ++ stat -c %u /var/lib/jenkins/workspace 2024-08-20T22:19:10.2146965Z + WORKSPACE_ORIGINAL_OWNER_ID=1000 2024-08-20T22:19:10.2147501Z + trap_add cleanup_workspace EXIT 2024-08-20T22:19:10.2148024Z + trap_add_cmd=cleanup_workspace 2024-08-20T22:19:10.2148654Z + shift 2024-08-20T22:19:10.2148931Z + for trap_add_name in "$@" 2024-08-20T22:19:10.2155254Z +++ trap -p EXIT 2024-08-20T22:19:10.2158309Z ++ eval 'extract_trap_cmd ' 2024-08-20T22:19:10.2158797Z +++ extract_trap_cmd 2024-08-20T22:19:10.2159270Z +++ printf '%s\n' '' 2024-08-20T22:19:10.2159670Z ++ printf '%s\n' cleanup_workspace 2024-08-20T22:19:10.2162136Z + trap -- ' 2024-08-20T22:19:10.2162603Z cleanup_workspace' EXIT 2024-08-20T22:19:10.2163185Z + sudo chown -R jenkins /var/lib/jenkins/workspace 2024-08-20T22:19:10.8894067Z + git config --global --add safe.directory /var/lib/jenkins/workspace 2024-08-20T22:19:10.8916729Z + echo 'Environment variables:' 2024-08-20T22:19:10.8917287Z Environment variables: 2024-08-20T22:19:10.8917649Z + env 2024-08-20T22:19:10.8928309Z INSTALLED_DB=yes 2024-08-20T22:19:10.8928822Z NV_LIBCUBLAS_VERSION=12.4.2.65-1 2024-08-20T22:19:10.8929298Z NVIDIA_VISIBLE_DEVICES=all 2024-08-20T22:19:10.8933353Z NV_NVML_DEV_VERSION=12.4.99-1 2024-08-20T22:19:10.8934231Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T22:19:10.8934968Z CONTINUE_THROUGH_ERROR=False 2024-08-20T22:19:10.8935510Z NV_LIBNCCL_DEV_PACKAGE=libnccl-dev=2.20.5-1+cuda12.4 2024-08-20T22:19:10.8936074Z NV_LIBNCCL_DEV_PACKAGE_VERSION=2.20.5-1 2024-08-20T22:19:10.8936739Z BUILD_ENVIRONMENT=linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:19:10.8937244Z HOSTNAME=00fa8332bfd4 2024-08-20T22:19:10.8938264Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.8939249Z GITHUB_ACTION=__self 2024-08-20T22:19:10.8939656Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-08-20T22:19:10.8945712Z NVIDIA_REQUIRE_CUDA=cuda>=12.4 brand=tesla,driver>=470,driver<471 brand=unknown,driver>=470,driver<471 brand=nvidia,driver>=470,driver<471 brand=nvidiartx,driver>=470,driver<471 brand=geforce,driver>=470,driver<471 brand=geforcertx,driver>=470,driver<471 brand=quadro,driver>=470,driver<471 brand=quadrortx,driver>=470,driver<471 brand=titan,driver>=470,driver<471 brand=titanrtx,driver>=470,driver<471 brand=tesla,driver>=525,driver<526 brand=unknown,driver>=525,driver<526 brand=nvidia,driver>=525,driver<526 brand=nvidiartx,driver>=525,driver<526 brand=geforce,driver>=525,driver<526 brand=geforcertx,driver>=525,driver<526 brand=quadro,driver>=525,driver<526 brand=quadrortx,driver>=525,driver<526 brand=titan,driver>=525,driver<526 brand=titanrtx,driver>=525,driver<526 brand=tesla,driver>=535,driver<536 brand=unknown,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=geforce,driver>=535,driver<536 brand=geforcertx,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=titan,driver>=535,driver<536 brand=titanrtx,driver>=535,driver<536 2024-08-20T22:19:10.8951368Z NV_LIBCUBLAS_DEV_PACKAGE=libcublas-dev-12-4=12.4.2.65-1 2024-08-20T22:19:10.8951868Z NV_NVTX_VERSION=12.4.99-1 2024-08-20T22:19:10.8952202Z GITHUB_RUN_NUMBER=92245 2024-08-20T22:19:10.8952531Z TEST_CONFIG=default 2024-08-20T22:19:10.8952857Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-08-20T22:19:10.8953364Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2024-08-20T22:19:10.8953805Z NV_CUDA_CUDART_DEV_VERSION=12.4.99-1 2024-08-20T22:19:10.8954283Z NV_LIBCUSPARSE_VERSION=12.3.0.142-1 2024-08-20T22:19:10.8954687Z SCRIBE_GRAPHQL_ACCESS_TOKEN= 2024-08-20T22:19:10.8955301Z NV_LIBNPP_VERSION=12.2.5.2-1 2024-08-20T22:19:10.8955756Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-08-20T22:19:10.8956242Z CMAKE_CUDA_COMPILER_LAUNCHER=/opt/cache/bin/sccache 2024-08-20T22:19:10.8956692Z GITHUB_REF_TYPE=tag 2024-08-20T22:19:10.8957019Z TORCH_CUDA_ARCH_LIST=Maxwell 2024-08-20T22:19:10.8957398Z NCCL_VERSION=2.20.5-1 2024-08-20T22:19:10.8957781Z BASE_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.8958223Z XLA_CUDA= 2024-08-20T22:19:10.8958518Z HUGGING_FACE_HUB_TOKEN= 2024-08-20T22:19:10.8960672Z *** 2024-08-20T22:19:10.8960974Z CARGO_NET_GIT_FETCH_WITH_CLI=true 2024-08-20T22:19:10.8961558Z GITHUB_REPOSITORY_ID=65600975 2024-08-20T22:19:10.8961915Z GITHUB_ACTIONS=true 2024-08-20T22:19:10.8962247Z NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:19:10.8962760Z NV_NVPROF_DEV_PACKAGE=cuda-nvprof-12-4=12.4.99-1 2024-08-20T22:19:10.8963293Z NV_LIBNPP_PACKAGE=libnpp-12-4=12.2.5.2-1 2024-08-20T22:19:10.8963766Z SHA1=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.8964294Z NV_LIBNCCL_DEV_PACKAGE_NAME=libnccl-dev 2024-08-20T22:19:10.8964770Z GITHUB_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.8965513Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/133712 2024-08-20T22:19:10.8966178Z UCC_HOME=/usr 2024-08-20T22:19:10.8966515Z NV_LIBCUBLAS_DEV_VERSION=12.4.2.65-1 2024-08-20T22:19:10.8966921Z VERBOSE_TEST_LOGS=False 2024-08-20T22:19:10.8967270Z NVIDIA_PRODUCT_NAME=CUDA 2024-08-20T22:19:10.8968077Z NV_LIBCUBLAS_DEV_PACKAGE_NAME=libcublas-dev-12-4 2024-08-20T22:19:10.8968568Z GITHUB_REF=refs/tags/ciflow/trunk/133712 2024-08-20T22:19:10.8969037Z NV_CUDA_CUDART_VERSION=12.4.99-1 2024-08-20T22:19:10.8969400Z SHARD_NUMBER=1 2024-08-20T22:19:10.8969701Z GITHUB_REF_PROTECTED=false 2024-08-20T22:19:10.8970041Z HOME=/var/lib/jenkins 2024-08-20T22:19:10.8970388Z GITHUB_API_URL=https://api.github.com 2024-08-20T22:19:10.8970811Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-08-20T22:19:10.8971292Z UCX_COMMIT=7bb2722ff2187a0cad557ae4a6afa090569f83fb 2024-08-20T22:19:10.8971750Z SCCACHE_S3_KEY_PREFIX=trunk 2024-08-20T22:19:10.8972096Z CUDA_VERSION=12.4.0 2024-08-20T22:19:10.8972513Z NV_LIBCUBLAS_PACKAGE=libcublas-12-4=12.4.2.65-1 2024-08-20T22:19:10.8972932Z NUM_TEST_SHARDS=5 2024-08-20T22:19:10.8973229Z UCX_HOME=/usr 2024-08-20T22:19:10.8973734Z NV_CUDA_NSIGHT_COMPUTE_DEV_PACKAGE=cuda-nsight-compute-12-4=12.4.0-1 2024-08-20T22:19:10.8974782Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.8976040Z JOB_NAME=linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:19:10.8977283Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.8978363Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2024-08-20T22:19:10.8978990Z GITHUB_EVENT_NAME=push 2024-08-20T22:19:10.8979316Z DASHBOARD_TAG= 2024-08-20T22:19:10.8979615Z GITHUB_RUN_ID=10479310961 2024-08-20T22:19:10.8980061Z NV_LIBNPP_DEV_PACKAGE=libnpp-dev-12-4=12.2.5.2-1 2024-08-20T22:19:10.8980586Z NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-4 2024-08-20T22:19:10.8981553Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.8982429Z GITHUB_ACTOR=pytorch-bot[bot] 2024-08-20T22:19:10.8982834Z NV_LIBNPP_DEV_VERSION=12.2.5.2-1 2024-08-20T22:19:10.8983193Z PR_NUMBER= 2024-08-20T22:19:10.8983466Z GITHUB_RUN_ATTEMPT=1 2024-08-20T22:19:10.8983800Z ANACONDA_PYTHON_VERSION=3.10 2024-08-20T22:19:10.8984235Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-08-20T22:19:10.8984673Z TERM=vt100 2024-08-20T22:19:10.8985002Z NV_LIBCUSPARSE_DEV_VERSION=12.3.0.142-1 2024-08-20T22:19:10.8985403Z INSTALLED_VISION=yes 2024-08-20T22:19:10.8985699Z BRANCH= 2024-08-20T22:19:10.8985975Z OPENSSL_ROOT_DIR=/opt/openssl 2024-08-20T22:19:10.8986531Z LIBRARY_PATH=/usr/local/cuda/lib64/stubs 2024-08-20T22:19:10.8986952Z CUDA_PATH=/usr/local/cuda 2024-08-20T22:19:10.8987697Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2024-08-20T22:19:10.8988428Z GITHUB_SERVER_URL=https://github.com 2024-08-20T22:19:10.8988902Z UCC_COMMIT=20eae37090a4ce1b32bcce6144ccad0b49943e0b 2024-08-20T22:19:10.8989364Z REENABLED_ISSUES= 2024-08-20T22:19:10.8989661Z SHLVL=1 2024-08-20T22:19:10.8989917Z MAX_JOBS=14 2024-08-20T22:19:10.8990234Z NV_CUDA_LIB_VERSION=12.4.0-1 2024-08-20T22:19:10.8990583Z NVARCH=x86_64 2024-08-20T22:19:10.8991007Z GITHUB_ACTOR_ID=54816060 2024-08-20T22:19:10.8991458Z GITHUB_WORKFLOW_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.8991977Z GITHUB_REF_NAME=ciflow/trunk/133712 2024-08-20T22:19:10.8992444Z NV_CUDA_COMPAT_PACKAGE=cuda-compat-12-4 2024-08-20T22:19:10.8993099Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-08-20T22:19:10.8993668Z GITHUB_JOB=test 2024-08-20T22:19:10.8994069Z NV_LIBNCCL_PACKAGE=libnccl2=2.20.5-1+cuda12.4 2024-08-20T22:19:10.8994612Z LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64 2024-08-20T22:19:10.8995112Z NO_TEST_TIMEOUT=False 2024-08-20T22:19:10.8995431Z TD_DISTRIBUTED=False 2024-08-20T22:19:10.8995804Z NV_CUDA_NSIGHT_COMPUTE_VERSION=12.4.0-1 2024-08-20T22:19:10.8996232Z GITHUB_REPOSITORY=pytorch/pytorch 2024-08-20T22:19:10.8996640Z NV_NVPROF_VERSION=12.4.99-1 2024-08-20T22:19:10.8996998Z GITHUB_RETENTION_DAYS=90 2024-08-20T22:19:10.8997349Z OPENSSL_DIR=/opt/openssl 2024-08-20T22:19:10.8997690Z GITHUB_ACTION_REPOSITORY= 2024-08-20T22:19:10.8998708Z PATH=/opt/cache/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-08-20T22:19:10.8999745Z GITHUB_BASE_REF= 2024-08-20T22:19:10.9000168Z NV_LIBNCCL_PACKAGE_NAME=libnccl2 2024-08-20T22:19:10.9000534Z CI=true 2024-08-20T22:19:10.9000853Z NV_LIBNCCL_PACKAGE_VERSION=2.20.5-1 2024-08-20T22:19:10.9001261Z GITHUB_REPOSITORY_OWNER=pytorch 2024-08-20T22:19:10.9001630Z JOB_ID=29026448828 2024-08-20T22:19:10.9001941Z INSTALLED_PROTOBUF=yes 2024-08-20T22:19:10.9002262Z GITHUB_HEAD_REF= 2024-08-20T22:19:10.9002562Z GITHUB_ACTION_REF= 2024-08-20T22:19:10.9002996Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-08-20T22:19:10.9003449Z TEST_SHOWLOCALS=False 2024-08-20T22:19:10.9003775Z GITHUB_WORKFLOW=trunk 2024-08-20T22:19:10.9004117Z DEBIAN_FRONTEND=noninteractive 2024-08-20T22:19:10.9004993Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9005784Z NO_TD=False 2024-08-20T22:19:10.9006087Z SKIP_SCCACHE_INITIALIZATION=1 2024-08-20T22:19:10.9006439Z _=/usr/bin/env 2024-08-20T22:19:10.9006914Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2024-08-20T22:19:10.9162984Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2024-08-20T22:19:10.9164166Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T22:19:10.9165248Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2024-08-20T22:19:10.9166117Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2024-08-20T22:19:10.9166710Z + BUILD_DIR=build 2024-08-20T22:19:10.9167026Z + BUILD_RENAMED_DIR=build_renamed 2024-08-20T22:19:10.9167408Z + BUILD_BIN_DIR=build/bin 2024-08-20T22:19:10.9168014Z + SHARD_NUMBER=1 2024-08-20T22:19:10.9168320Z + NUM_TEST_SHARDS=5 2024-08-20T22:19:10.9168640Z + export VALGRIND=ON 2024-08-20T22:19:10.9169010Z + VALGRIND=ON 2024-08-20T22:19:10.9169611Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *clang9* ]] 2024-08-20T22:19:10.9170094Z + [[ 0 == \1 ]] 2024-08-20T22:19:10.9170384Z + [[ False == \1 ]] 2024-08-20T22:19:10.9170849Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *bazel* ]] 2024-08-20T22:19:10.9171362Z ++ realpath build/custom_test_artifacts 2024-08-20T22:19:10.9181375Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/workspace/build/custom_test_artifacts 2024-08-20T22:19:10.9182104Z + [[ -n '' ]] 2024-08-20T22:19:10.9182468Z + echo 'Environment variables' 2024-08-20T22:19:10.9182853Z Environment variables 2024-08-20T22:19:10.9183169Z + env 2024-08-20T22:19:10.9190115Z INSTALLED_DB=yes 2024-08-20T22:19:10.9190676Z NV_LIBCUBLAS_VERSION=12.4.2.65-1 2024-08-20T22:19:10.9191198Z NVIDIA_VISIBLE_DEVICES=all 2024-08-20T22:19:10.9191712Z NV_NVML_DEV_VERSION=12.4.99-1 2024-08-20T22:19:10.9192502Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T22:19:10.9193261Z CONTINUE_THROUGH_ERROR=False 2024-08-20T22:19:10.9194266Z NV_LIBNCCL_DEV_PACKAGE=libnccl-dev=2.20.5-1+cuda12.4 2024-08-20T22:19:10.9194894Z NV_LIBNCCL_DEV_PACKAGE_VERSION=2.20.5-1 2024-08-20T22:19:10.9195539Z BUILD_ENVIRONMENT=linux-focal-cuda12.4-py3.10-gcc9-sm86 2024-08-20T22:19:10.9196184Z HOSTNAME=00fa8332bfd4 2024-08-20T22:19:10.9197128Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9197959Z GITHUB_ACTION=__self 2024-08-20T22:19:10.9198297Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-08-20T22:19:10.9204411Z NVIDIA_REQUIRE_CUDA=cuda>=12.4 brand=tesla,driver>=470,driver<471 brand=unknown,driver>=470,driver<471 brand=nvidia,driver>=470,driver<471 brand=nvidiartx,driver>=470,driver<471 brand=geforce,driver>=470,driver<471 brand=geforcertx,driver>=470,driver<471 brand=quadro,driver>=470,driver<471 brand=quadrortx,driver>=470,driver<471 brand=titan,driver>=470,driver<471 brand=titanrtx,driver>=470,driver<471 brand=tesla,driver>=525,driver<526 brand=unknown,driver>=525,driver<526 brand=nvidia,driver>=525,driver<526 brand=nvidiartx,driver>=525,driver<526 brand=geforce,driver>=525,driver<526 brand=geforcertx,driver>=525,driver<526 brand=quadro,driver>=525,driver<526 brand=quadrortx,driver>=525,driver<526 brand=titan,driver>=525,driver<526 brand=titanrtx,driver>=525,driver<526 brand=tesla,driver>=535,driver<536 brand=unknown,driver>=535,driver<536 brand=nvidia,driver>=535,driver<536 brand=nvidiartx,driver>=535,driver<536 brand=geforce,driver>=535,driver<536 brand=geforcertx,driver>=535,driver<536 brand=quadro,driver>=535,driver<536 brand=quadrortx,driver>=535,driver<536 brand=titan,driver>=535,driver<536 brand=titanrtx,driver>=535,driver<536 2024-08-20T22:19:10.9210316Z NV_LIBCUBLAS_DEV_PACKAGE=libcublas-dev-12-4=12.4.2.65-1 2024-08-20T22:19:10.9210822Z NV_NVTX_VERSION=12.4.99-1 2024-08-20T22:19:10.9211162Z GITHUB_RUN_NUMBER=92245 2024-08-20T22:19:10.9211496Z TEST_CONFIG=default 2024-08-20T22:19:10.9211829Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-08-20T22:19:10.9212304Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2024-08-20T22:19:10.9212771Z NV_CUDA_CUDART_DEV_VERSION=12.4.99-1 2024-08-20T22:19:10.9213215Z NV_LIBCUSPARSE_VERSION=12.3.0.142-1 2024-08-20T22:19:10.9213620Z SCRIBE_GRAPHQL_ACCESS_TOKEN= 2024-08-20T22:19:10.9214018Z NV_LIBNPP_VERSION=12.2.5.2-1 2024-08-20T22:19:10.9214459Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-08-20T22:19:10.9214999Z CMAKE_CUDA_COMPILER_LAUNCHER=/opt/cache/bin/sccache 2024-08-20T22:19:10.9215458Z GITHUB_REF_TYPE=tag 2024-08-20T22:19:10.9215779Z TORCH_CUDA_ARCH_LIST=Maxwell 2024-08-20T22:19:10.9216153Z NCCL_VERSION=2.20.5-1 2024-08-20T22:19:10.9216547Z BASE_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.9216991Z XLA_CUDA= 2024-08-20T22:19:10.9217267Z HUGGING_FACE_HUB_TOKEN= 2024-08-20T22:19:10.9217661Z *** 2024-08-20T22:19:10.9217983Z CARGO_NET_GIT_FETCH_WITH_CLI=true 2024-08-20T22:19:10.9218382Z GITHUB_REPOSITORY_ID=65600975 2024-08-20T22:19:10.9218740Z GITHUB_ACTIONS=true 2024-08-20T22:19:10.9219084Z NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T22:19:10.9219579Z NV_NVPROF_DEV_PACKAGE=cuda-nvprof-12-4=12.4.99-1 2024-08-20T22:19:10.9220106Z NV_LIBNPP_PACKAGE=libnpp-12-4=12.2.5.2-1 2024-08-20T22:19:10.9220575Z SHA1=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.9221093Z NV_LIBNCCL_DEV_PACKAGE_NAME=libnccl-dev 2024-08-20T22:19:10.9221576Z GITHUB_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.9222447Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/133712 2024-08-20T22:19:10.9223120Z UCC_HOME=/usr 2024-08-20T22:19:10.9223471Z NV_LIBCUBLAS_DEV_VERSION=12.4.2.65-1 2024-08-20T22:19:10.9223869Z VERBOSE_TEST_LOGS=False 2024-08-20T22:19:10.9224220Z NVIDIA_PRODUCT_NAME=CUDA 2024-08-20T22:19:10.9224689Z NV_LIBCUBLAS_DEV_PACKAGE_NAME=libcublas-dev-12-4 2024-08-20T22:19:10.9225172Z GITHUB_REF=refs/tags/ciflow/trunk/133712 2024-08-20T22:19:10.9225630Z NV_CUDA_CUDART_VERSION=12.4.99-1 2024-08-20T22:19:10.9226006Z SHARD_NUMBER=1 2024-08-20T22:19:10.9226406Z GITHUB_REF_PROTECTED=false 2024-08-20T22:19:10.9226765Z HOME=/var/lib/jenkins 2024-08-20T22:19:10.9227128Z GITHUB_API_URL=https://api.github.com 2024-08-20T22:19:10.9227561Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-08-20T22:19:10.9228038Z UCX_COMMIT=7bb2722ff2187a0cad557ae4a6afa090569f83fb 2024-08-20T22:19:10.9228519Z SCCACHE_S3_KEY_PREFIX=trunk 2024-08-20T22:19:10.9228871Z CUDA_VERSION=12.4.0 2024-08-20T22:19:10.9229314Z NV_LIBCUBLAS_PACKAGE=libcublas-12-4=12.4.2.65-1 2024-08-20T22:19:10.9229756Z NUM_TEST_SHARDS=5 2024-08-20T22:19:10.9230055Z UCX_HOME=/usr 2024-08-20T22:19:10.9230574Z NV_CUDA_NSIGHT_COMPUTE_DEV_PACKAGE=cuda-nsight-compute-12-4=12.4.0-1 2024-08-20T22:19:10.9231634Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9232888Z JOB_NAME=linux-focal-cuda12.4-py3.10-gcc9-sm86 / test (default, 1, 5, amz2023.linux.g5.4xlarge.nvidia.gpu) 2024-08-20T22:19:10.9234122Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9235223Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2024-08-20T22:19:10.9235855Z GITHUB_EVENT_NAME=push 2024-08-20T22:19:10.9236185Z DASHBOARD_TAG= 2024-08-20T22:19:10.9236490Z GITHUB_RUN_ID=10479310961 2024-08-20T22:19:10.9236944Z NV_LIBNPP_DEV_PACKAGE=libnpp-dev-12-4=12.2.5.2-1 2024-08-20T22:19:10.9237481Z NV_LIBCUBLAS_PACKAGE_NAME=libcublas-12-4 2024-08-20T22:19:10.9238458Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9239334Z GITHUB_ACTOR=pytorch-bot[bot] 2024-08-20T22:19:10.9239739Z NV_LIBNPP_DEV_VERSION=12.2.5.2-1 2024-08-20T22:19:10.9240198Z PR_NUMBER= 2024-08-20T22:19:10.9240471Z GITHUB_RUN_ATTEMPT=1 2024-08-20T22:19:10.9240782Z VALGRIND=ON 2024-08-20T22:19:10.9241076Z ANACONDA_PYTHON_VERSION=3.10 2024-08-20T22:19:10.9241507Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-08-20T22:19:10.9241970Z TERM=vt100 2024-08-20T22:19:10.9242308Z NV_LIBCUSPARSE_DEV_VERSION=12.3.0.142-1 2024-08-20T22:19:10.9242711Z INSTALLED_VISION=yes 2024-08-20T22:19:10.9243020Z BRANCH= 2024-08-20T22:19:10.9243304Z OPENSSL_ROOT_DIR=/opt/openssl 2024-08-20T22:19:10.9243691Z LIBRARY_PATH=/usr/local/cuda/lib64/stubs 2024-08-20T22:19:10.9244124Z CUDA_PATH=/usr/local/cuda 2024-08-20T22:19:10.9244907Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2024-08-20T22:19:10.9245637Z GITHUB_SERVER_URL=https://github.com 2024-08-20T22:19:10.9246117Z UCC_COMMIT=20eae37090a4ce1b32bcce6144ccad0b49943e0b 2024-08-20T22:19:10.9246574Z REENABLED_ISSUES= 2024-08-20T22:19:10.9246858Z SHLVL=1 2024-08-20T22:19:10.9247121Z MAX_JOBS=14 2024-08-20T22:19:10.9247431Z NV_CUDA_LIB_VERSION=12.4.0-1 2024-08-20T22:19:10.9247770Z NVARCH=x86_64 2024-08-20T22:19:10.9248061Z GITHUB_ACTOR_ID=54816060 2024-08-20T22:19:10.9248515Z GITHUB_WORKFLOW_SHA=40ec5f6ddd9787aca0449b24128343ff4c4a88b3 2024-08-20T22:19:10.9249040Z GITHUB_REF_NAME=ciflow/trunk/133712 2024-08-20T22:19:10.9249521Z NV_CUDA_COMPAT_PACKAGE=cuda-compat-12-4 2024-08-20T22:19:10.9250185Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-08-20T22:19:10.9250745Z GITHUB_JOB=test 2024-08-20T22:19:10.9251151Z NV_LIBNCCL_PACKAGE=libnccl2=2.20.5-1+cuda12.4 2024-08-20T22:19:10.9251808Z LD_LIBRARY_PATH=/usr/local/nvidia/lib:/usr/local/nvidia/lib64 2024-08-20T22:19:10.9252322Z NO_TEST_TIMEOUT=False 2024-08-20T22:19:10.9252660Z TD_DISTRIBUTED=False 2024-08-20T22:19:10.9253051Z NV_CUDA_NSIGHT_COMPUTE_VERSION=12.4.0-1 2024-08-20T22:19:10.9253489Z GITHUB_REPOSITORY=pytorch/pytorch 2024-08-20T22:19:10.9253915Z NV_NVPROF_VERSION=12.4.99-1 2024-08-20T22:19:10.9254281Z GITHUB_RETENTION_DAYS=90 2024-08-20T22:19:10.9254680Z OPENSSL_DIR=/opt/openssl 2024-08-20T22:19:10.9255041Z GITHUB_ACTION_REPOSITORY= 2024-08-20T22:19:10.9256063Z PATH=/opt/cache/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-08-20T22:19:10.9257201Z GITHUB_BASE_REF= 2024-08-20T22:19:10.9257533Z NV_LIBNCCL_PACKAGE_NAME=libnccl2 2024-08-20T22:19:10.9257909Z CI=true 2024-08-20T22:19:10.9258233Z NV_LIBNCCL_PACKAGE_VERSION=2.20.5-1 2024-08-20T22:19:10.9258654Z GITHUB_REPOSITORY_OWNER=pytorch 2024-08-20T22:19:10.9259032Z JOB_ID=29026448828 2024-08-20T22:19:10.9259354Z INSTALLED_PROTOBUF=yes 2024-08-20T22:19:10.9259693Z GITHUB_HEAD_REF= 2024-08-20T22:19:10.9260005Z GITHUB_ACTION_REF= 2024-08-20T22:19:10.9260442Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-08-20T22:19:10.9260907Z TEST_SHOWLOCALS=False 2024-08-20T22:19:10.9261237Z GITHUB_WORKFLOW=trunk 2024-08-20T22:19:10.9261572Z DEBIAN_FRONTEND=noninteractive 2024-08-20T22:19:10.9262462Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_231d9783-5be2-44f4-807c-460822871ead 2024-08-20T22:19:10.9263256Z NO_TD=False 2024-08-20T22:19:10.9263557Z SKIP_SCCACHE_INITIALIZATION=1 2024-08-20T22:19:10.9263917Z _=/usr/bin/env 2024-08-20T22:19:10.9264264Z + echo 'Testing pytorch' 2024-08-20T22:19:10.9264598Z Testing pytorch 2024-08-20T22:19:10.9264922Z + export LANG=C.UTF-8 2024-08-20T22:19:10.9265262Z + LANG=C.UTF-8 2024-08-20T22:19:10.9265546Z + PR_NUMBER= 2024-08-20T22:19:10.9265855Z + [[ default == \d\e\f\a\u\l\t ]] 2024-08-20T22:19:10.9266263Z + export CUDA_VISIBLE_DEVICES=0 2024-08-20T22:19:10.9266640Z + CUDA_VISIBLE_DEVICES=0 2024-08-20T22:19:10.9266990Z + export HIP_VISIBLE_DEVICES=0 2024-08-20T22:19:10.9267362Z + HIP_VISIBLE_DEVICES=0 2024-08-20T22:19:10.9268025Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2024-08-20T22:19:10.9268555Z + [[ default == \s\l\o\w ]] 2024-08-20T22:19:10.9269281Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *slow-gradcheck* ]] 2024-08-20T22:19:10.9269971Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *cuda* ]] 2024-08-20T22:19:10.9270515Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-08-20T22:19:10.9270996Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-08-20T22:19:10.9271409Z + [[ default == *crossref* ]] 2024-08-20T22:19:10.9271929Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *rocm* ]] 2024-08-20T22:19:10.9272564Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *xpu* ]] 2024-08-20T22:19:10.9273204Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *-bazel-* ]] 2024-08-20T22:19:10.9273769Z + pip_install --user ninja==1.10.2 2024-08-20T22:19:10.9274315Z + pip install --progress-bar off --user ninja==1.10.2 2024-08-20T22:19:12.2869163Z Collecting ninja==1.10.2 2024-08-20T22:19:12.3060149Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2024-08-20T22:19:12.3610907Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2024-08-20T22:19:12.7229196Z Installing collected packages: ninja 2024-08-20T22:19:12.7309332Z  WARNING: The script ninja is installed in '/var/lib/jenkins/.local/bin' which is not on PATH. 2024-08-20T22:19:12.7310600Z Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location. 2024-08-20T22:19:12.7813699Z Successfully installed ninja-1.10.2 2024-08-20T22:19:12.8485988Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-08-20T22:19:12.8488058Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-08-20T22:19:12.8489606Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *aarch64* ]] 2024-08-20T22:19:12.8490102Z + install_tlparse 2024-08-20T22:19:12.8490487Z + pip_install --user tlparse==0.3.25 2024-08-20T22:19:12.8491033Z + pip install --progress-bar off --user tlparse==0.3.25 2024-08-20T22:19:13.2793660Z Collecting tlparse==0.3.25 2024-08-20T22:19:13.3221541Z Downloading tlparse-0.3.25-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (1.7 kB) 2024-08-20T22:19:13.3778073Z Downloading tlparse-0.3.25-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.2 MB) 2024-08-20T22:19:13.7892567Z Installing collected packages: tlparse 2024-08-20T22:19:13.8764640Z Successfully installed tlparse-0.3.25 2024-08-20T22:19:13.9480795Z ++ python -m site --user-base 2024-08-20T22:19:13.9725149Z + PATH=/var/lib/jenkins/.local/bin:/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-08-20T22:19:13.9726900Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *asan* ]] 2024-08-20T22:19:13.9727550Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *-debug* ]] 2024-08-20T22:19:13.9728201Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *-bazel-* ]] 2024-08-20T22:19:13.9729118Z + echo 'We are not in debug mode: linux-focal-cuda12.4-py3.10-gcc9-sm86. Expect the assertion to pass' 2024-08-20T22:19:13.9730403Z We are not in debug mode: linux-focal-cuda12.4-py3.10-gcc9-sm86. Expect the assertion to pass 2024-08-20T22:19:13.9731096Z + cd test 2024-08-20T22:19:13.9731611Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2024-08-20T22:19:15.6100293Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2024-08-20T22:19:15.6100850Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2024-08-20T22:19:15.6104085Z + DYNAMO_BENCHMARK_FLAGS=() 2024-08-20T22:19:15.6105054Z + [[ default == *pr_time_benchmarks* ]] 2024-08-20T22:19:15.6105660Z + [[ default == *dynamo_eager* ]] 2024-08-20T22:19:15.6106195Z + [[ default == *aot_eager* ]] 2024-08-20T22:19:15.6106635Z + [[ default == *aot_inductor* ]] 2024-08-20T22:19:15.6107027Z + [[ default == *inductor* ]] 2024-08-20T22:19:15.6107396Z + [[ default == *dynamic* ]] 2024-08-20T22:19:15.6107749Z + [[ default == *cpu* ]] 2024-08-20T22:19:15.6108359Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2024-08-20T22:19:15.6137646Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *libtorch* ]] 2024-08-20T22:19:15.6138423Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *-bazel-* ]] 2024-08-20T22:19:15.6140752Z + cd test 2024-08-20T22:19:15.6141357Z + python -c 'import torch; print(torch.__config__.show())' 2024-08-20T22:19:17.0889339Z PyTorch built with: 2024-08-20T22:19:17.0890037Z - GCC 9.4 2024-08-20T22:19:17.0890451Z - C++ Version: 201703 2024-08-20T22:19:17.0891305Z - Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-08-20T22:19:17.0892364Z - Intel(R) MKL-DNN v3.4.2 (Git Hash 1137e04ec0b5251ca2b4400a4fd3c667ce843d67) 2024-08-20T22:19:17.0893018Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-08-20T22:19:17.0893520Z - LAPACK is enabled (usually provided by MKL) 2024-08-20T22:19:17.0893996Z - NNPACK is enabled 2024-08-20T22:19:17.0894374Z - CPU capability usage: AVX2 2024-08-20T22:19:17.0894772Z - CUDA Runtime 12.4 2024-08-20T22:19:17.0895300Z - NVCC architecture flags: -gencode;arch=compute_86,code=sm_86 2024-08-20T22:19:17.0895846Z - CuDNN 90.1 2024-08-20T22:19:17.0896162Z - Magma 2.6.1 2024-08-20T22:19:17.0903172Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, CUDA_VERSION=12.4, CUDNN_VERSION=9.1.0, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -D_GLIBCXX_USE_CXX11_ABI=1 -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOROCTRACER -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-type-limits -Wno-array-bounds -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -Wno-missing-braces -fdiagnostics-color=always -faligned-new -Werror -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, FORCE_FALLBACK_CUDA_MPI=1, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, PERF_WITH_AVX512=1, TORCH_VERSION=2.5.0, USE_CUDA=ON, USE_CUDNN=ON, USE_CUSPARSELT=ON, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=ON, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=OFF, USE_ROCM_KERNEL_ASSERT=OFF, 2024-08-20T22:19:17.0909501Z 2024-08-20T22:19:17.4080628Z + cd test 2024-08-20T22:19:17.4081358Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2024-08-20T22:19:18.7472061Z ATen/Parallel: 2024-08-20T22:19:18.7472522Z at::get_num_threads() : 8 2024-08-20T22:19:18.7473009Z at::get_num_interop_threads() : 16 2024-08-20T22:19:18.7474195Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-08-20T22:19:18.7474710Z omp_get_max_threads() : 8 2024-08-20T22:19:18.7476018Z Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-08-20T22:19:18.7476816Z mkl_get_max_threads() : 8 2024-08-20T22:19:18.7477437Z Intel(R) MKL-DNN v3.4.2 (Git Hash 1137e04ec0b5251ca2b4400a4fd3c667ce843d67) 2024-08-20T22:19:18.7478051Z std::thread::hardware_concurrency() : 16 2024-08-20T22:19:18.7478470Z Environment variables: 2024-08-20T22:19:18.7478815Z OMP_NUM_THREADS : [not set] 2024-08-20T22:19:18.7479173Z MKL_NUM_THREADS : [not set] 2024-08-20T22:19:18.7479549Z ATen parallel backend: OpenMP 2024-08-20T22:19:18.7479802Z 2024-08-20T22:19:19.0247217Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *aarch64* ]] 2024-08-20T22:19:19.0247917Z + [[ default == *backward* ]] 2024-08-20T22:19:19.0248393Z + [[ default == *xla* ]] 2024-08-20T22:19:19.0248769Z + [[ default == *executorch* ]] 2024-08-20T22:19:19.0249165Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2024-08-20T22:19:19.0249765Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *libtorch* ]] 2024-08-20T22:19:19.0250267Z + [[ default == distributed ]] 2024-08-20T22:19:19.0250664Z + [[ default == *inductor_distributed* ]] 2024-08-20T22:19:19.0251169Z + [[ default == *inductor-halide* ]] 2024-08-20T22:19:19.0251667Z + [[ default == *inductor-micro-benchmark* ]] 2024-08-20T22:19:19.0252115Z + [[ default == *huggingface* ]] 2024-08-20T22:19:19.0252492Z + [[ default == *timm* ]] 2024-08-20T22:19:19.0252903Z + [[ default == *torchbench* ]] 2024-08-20T22:19:19.0253369Z + [[ default == *inductor_cpp_wrapper_abi_compatible* ]] 2024-08-20T22:19:19.0253858Z + [[ default == *inductor* ]] 2024-08-20T22:19:19.0254216Z + [[ default == *dynamo* ]] 2024-08-20T22:19:19.0254722Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *rocm* ]] 2024-08-20T22:19:19.0255188Z + [[ 1 == 1 ]] 2024-08-20T22:19:19.0255508Z + [[ 5 -gt 1 ]] 2024-08-20T22:19:19.0255810Z + test_without_numpy 2024-08-20T22:19:19.0256143Z ++ dirname .ci/pytorch/test.sh 2024-08-20T22:19:19.0270457Z + pushd .ci/pytorch 2024-08-20T22:19:19.0270826Z ~/workspace/.ci/pytorch ~/workspace 2024-08-20T22:19:19.0272057Z + python -c 'import sys;sys.path.insert(0, '\''fake_numpy'\'');from unittest import TestCase;import torch;x=torch.randn(3,3);TestCase().assertRaises(RuntimeError, lambda: x.numpy())' 2024-08-20T22:19:19.9166239Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/functional_tensor.py:271: UserWarning: Failed to initialize NumPy: Sorry PyTorch, but our NumPy is in the other folder (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/utils/tensor_numpy.cpp:84.) 2024-08-20T22:19:19.9168716Z cpu = _conversion_method_template(device=torch.device("cpu")) 2024-08-20T22:19:20.5672971Z + python -c 'import sys;sys.path.insert(0, '\''fake_numpy'\'');import torch;print(torch.tensor([torch.tensor(0.), torch.tensor(1.)]))' 2024-08-20T22:19:21.4523313Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/functional_tensor.py:271: UserWarning: Failed to initialize NumPy: Sorry PyTorch, but our NumPy is in the other folder (Triggered internally at /var/lib/jenkins/workspace/torch/csrc/utils/tensor_numpy.cpp:84.) 2024-08-20T22:19:21.4525087Z cpu = _conversion_method_template(device=torch.device("cpu")) 2024-08-20T22:19:21.8443254Z tensor([0., 1.]) 2024-08-20T22:19:22.1022996Z + [[ default == *dynamo* ]] 2024-08-20T22:19:22.1023487Z + popd 2024-08-20T22:19:22.1023842Z ~/workspace 2024-08-20T22:19:22.1024216Z + install_torchvision 2024-08-20T22:19:22.1024541Z + local orig_preload 2024-08-20T22:19:22.1024855Z + local commit 2024-08-20T22:19:22.1029890Z ++ get_pinned_commit vision 2024-08-20T22:19:22.1030406Z ++ cat .github/ci_commit_pins/vision.txt 2024-08-20T22:19:22.1047465Z + commit=d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:22.1048055Z + orig_preload= 2024-08-20T22:19:22.1048529Z + '[' -n '' ']' 2024-08-20T22:19:22.1049356Z + pip_install --no-use-pep517 --user git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:22.1050745Z + pip install --progress-bar off --no-use-pep517 --user git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:22.4365194Z Collecting git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:22.4370422Z Cloning https://github.com/pytorch/vision.git (to revision d23a6e1664d20707c11781299611436e1f0c104f) to /tmp/pip-req-build-yo9ewcyp 2024-08-20T22:19:22.4401444Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-yo9ewcyp 2024-08-20T22:19:23.9297670Z Running command git rev-parse -q --verify 'sha^d23a6e1664d20707c11781299611436e1f0c104f' 2024-08-20T22:19:23.9324574Z Running command git fetch -q https://github.com/pytorch/vision.git d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:25.2569767Z Running command git checkout -q d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:25.5563469Z Resolved https://github.com/pytorch/vision.git to commit d23a6e1664d20707c11781299611436e1f0c104f 2024-08-20T22:19:28.0096406Z Preparing metadata (setup.py) ... [?25l- \ done 2024-08-20T22:19:28.0130647Z [?25hRequirement already satisfied: numpy in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (1.22.4) 2024-08-20T22:19:28.0133859Z Requirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (2.5.0a0+git40ec5f6) 2024-08-20T22:19:28.0138416Z Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (10.3.0) 2024-08-20T22:19:28.0211139Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (3.13.1) 2024-08-20T22:19:28.0214940Z Requirement already satisfied: typing-extensions>=4.8.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (4.12.2) 2024-08-20T22:19:28.0218517Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (2.8.8) 2024-08-20T22:19:28.0222091Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (3.1.4) 2024-08-20T22:19:28.0225653Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (2024.6.1) 2024-08-20T22:19:28.0231646Z Requirement already satisfied: sympy==1.13.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (1.13.1) 2024-08-20T22:19:28.0246358Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy==1.13.1->torch->torchvision==0.19.0a0+d23a6e1) (1.3.0) 2024-08-20T22:19:28.0722107Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->torchvision==0.19.0a0+d23a6e1) (2.1.5) 2024-08-20T22:19:28.0786169Z Building wheels for collected packages: torchvision 2024-08-20T22:20:43.2088869Z Building wheel for torchvision (setup.py) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2024-08-20T22:20:43.2122583Z [?25h Created wheel for torchvision: filename=torchvision-0.19.0a0+d23a6e1-cp310-cp310-linux_x86_64.whl size=2024877 sha256=fb8d31c8fb563e9ef745c84ba6917133eb147a5aa2da90ac5d596173c540d332 2024-08-20T22:20:43.2125725Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/0e/56/35/02931e71eb23fd2b85591c7ec05b733ca7c8b328a2fd151f96 2024-08-20T22:20:43.2155094Z Successfully built torchvision 2024-08-20T22:20:43.4608103Z Installing collected packages: torchvision 2024-08-20T22:20:43.8801124Z Successfully installed torchvision-0.19.0a0+d23a6e1 2024-08-20T22:20:44.0109401Z + '[' -n '' ']' 2024-08-20T22:20:44.0111509Z + test_python_shard 1 2024-08-20T22:20:44.0112103Z + [[ -z 5 ]] 2024-08-20T22:20:44.0112811Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --shard 1 5 --verbose 2024-08-20T22:20:44.1083651Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T22:20:44.1084732Z import pkg_resources 2024-08-20T22:20:47.7392892Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T22:20:47.8320484Z Ignoring disabled issues: [''] 2024-08-20T22:20:47.8429028Z Found test times from artifacts 2024-08-20T22:20:47.8858429Z Found test times from artifacts 2024-08-20T22:20:47.8873403Z Running 25% of tests based on TD 2024-08-20T22:20:47.9177982Z Running parallel tests on 3 processes 2024-08-20T22:20:47.9180455Z Name: tests to run (est. time: 39.71min) 2024-08-20T22:20:47.9181022Z Serial tests (4): 2024-08-20T22:20:47.9181453Z inductor/test_max_autotune 1/1 2024-08-20T22:20:47.9181976Z inductor/test_distributed_patterns 1/1 2024-08-20T22:20:47.9184298Z test_utils 1/1 2024-08-20T22:20:47.9184814Z test_nn 1/1 2024-08-20T22:20:47.9185296Z Parallel tests (22): 2024-08-20T22:20:47.9185894Z inductor/test_torchinductor_opinfo 3/13 2024-08-20T22:20:47.9186680Z inductor/test_torchinductor_dynamic_shapes 4/6 2024-08-20T22:20:47.9191536Z inductor/test_torchinductor_dynamic_shapes 5/6 2024-08-20T22:20:47.9192292Z inductor/test_torchinductor_dynamic_shapes 6/6 2024-08-20T22:20:47.9192798Z inductor/test_mmdecomp 1/1 2024-08-20T22:20:47.9193288Z dynamo/test_interop 1/1 2024-08-20T22:20:47.9193752Z dynamo/test_logging 1/1 2024-08-20T22:20:47.9194193Z dynamo/test_exc 1/1 2024-08-20T22:20:47.9194618Z dynamo/test_global 1/1 2024-08-20T22:20:47.9196527Z dynamo/test_unspec 1/1 2024-08-20T22:20:47.9197085Z inductor/test_cudagraph_trees 1/1 2024-08-20T22:20:47.9197662Z dynamo/test_ctx_manager 1/1 2024-08-20T22:20:47.9198145Z dynamo/test_subgraphs 1/1 2024-08-20T22:20:47.9198646Z inductor/test_pattern_matcher 1/1 2024-08-20T22:20:47.9199177Z dynamo/test_autograd_function 1/1 2024-08-20T22:20:47.9199832Z dynamo/test_activation_checkpointing 1/1 2024-08-20T22:20:47.9200430Z inductor/test_inductor_freezing 1/1 2024-08-20T22:20:47.9200894Z inductor/test_mkldnn_pattern_matcher 1/1 2024-08-20T22:20:47.9201322Z inductor/test_aot_inductor 6/16 2024-08-20T22:20:47.9202021Z inductor/test_aot_inductor 7/16 2024-08-20T22:20:47.9202432Z inductor/test_aot_inductor 14/16 2024-08-20T22:20:47.9202843Z inductor/test_cpu_cpp_wrapper 1/1 2024-08-20T22:20:47.9203262Z Name: excluded (est. time: 45.63min) 2024-08-20T22:20:47.9203653Z Serial tests (42): 2024-08-20T22:20:47.9203985Z inductor/test_flex_attention 1/2 2024-08-20T22:20:47.9204391Z inductor/test_flex_attention 2/2 2024-08-20T22:20:47.9204770Z test_fx 1/1 2024-08-20T22:20:47.9205061Z test_reductions 1/1 2024-08-20T22:20:47.9205403Z test_multiprocessing 1/1 2024-08-20T22:20:47.9206687Z test_torch 1/1 2024-08-20T22:20:47.9207010Z test_tensorexpr 1/1 2024-08-20T22:20:47.9207382Z inductor/test_benchmark_fusion 1/1 2024-08-20T22:20:47.9207789Z test_tensor_creation_ops 1/1 2024-08-20T22:20:47.9208157Z test_cpp_extensions_jit 1/1 2024-08-20T22:20:47.9208523Z nn/test_convolution 1/1 2024-08-20T22:20:47.9208896Z distributions/test_distributions 1/1 2024-08-20T22:20:47.9209322Z inductor/test_cutlass_backend 1/1 2024-08-20T22:20:47.9209707Z test_dispatch 1/1 2024-08-20T22:20:47.9210042Z test_multiprocessing_spawn 1/1 2024-08-20T22:20:47.9210419Z test_spectral_ops 1/1 2024-08-20T22:20:47.9210753Z test_fake_tensor 1/1 2024-08-20T22:20:47.9211086Z test_cpp_api_parity 1/1 2024-08-20T22:20:47.9211499Z test_cpp_extensions_open_device_registration 1/1 2024-08-20T22:20:47.9211999Z functorch/test_memory_efficient_fusion 1/1 2024-08-20T22:20:47.9212425Z nn/test_pooling 1/1 2024-08-20T22:20:47.9212746Z test_sort_and_select 1/1 2024-08-20T22:20:47.9213109Z test_mobile_optimizer 1/1 2024-08-20T22:20:47.9213462Z test_cuda_trace 1/1 2024-08-20T22:20:47.9213775Z test_overrides 1/1 2024-08-20T22:20:47.9214110Z test_namedtuple_return_api 1/1 2024-08-20T22:20:47.9214485Z test_autocast 1/1 2024-08-20T22:20:47.9214801Z test_python_dispatch 1/1 2024-08-20T22:20:47.9215147Z test_native_mha 1/1 2024-08-20T22:20:47.9215498Z test_cpp_extensions_aot_ninja 1/1 2024-08-20T22:20:47.9215908Z test_cpp_extensions_aot_no_ninja 1/1 2024-08-20T22:20:47.9216314Z test_autograd_fallback 1/1 2024-08-20T22:20:47.9216683Z test_cuda_nvml_based_avail 1/1 2024-08-20T22:20:47.9217049Z test_jit_disabled 1/1 2024-08-20T22:20:47.9217381Z test_show_pickle 1/1 2024-08-20T22:20:47.9217718Z test_cuda_primary_ctx 1/1 2024-08-20T22:20:47.9218088Z test_cpp_extensions_mtia_backend 1/1 2024-08-20T22:20:47.9218525Z test_cpp_extensions_stream_and_event 1/1 2024-08-20T22:20:47.9218954Z test_ci_sanity_check_fail 1/1 2024-08-20T22:20:47.9219313Z doctests 1/1 2024-08-20T22:20:47.9219615Z test_autoload_disable 1/1 2024-08-20T22:20:47.9219970Z test_autoload_enable 1/1 2024-08-20T22:20:47.9220312Z Parallel tests (46): 2024-08-20T22:20:47.9220663Z inductor/test_cuda_cpp_wrapper 4/5 2024-08-20T22:20:47.9221059Z test_matmul_cuda 1/1 2024-08-20T22:20:47.9221396Z functorch/test_control_flow 1/1 2024-08-20T22:20:47.9221777Z test_cuda 1/1 2024-08-20T22:20:47.9222098Z test_cuda_expandable_segments 1/1 2024-08-20T22:20:47.9222496Z inductor/test_multi_kernel 1/1 2024-08-20T22:20:47.9222901Z inductor/test_aot_inductor_package 1/1 2024-08-20T22:20:47.9223333Z functorch/test_aotdispatch 1/1 2024-08-20T22:20:47.9223746Z inductor/test_decompose_mem_bound_mm 1/1 2024-08-20T22:20:47.9224162Z test_typing 1/1 2024-08-20T22:20:47.9224475Z test_mkldnn_fusion 1/1 2024-08-20T22:20:47.9224801Z test_testing 1/1 2024-08-20T22:20:47.9225125Z profiler/test_profiler 1/1 2024-08-20T22:20:47.9225509Z test_sparse_semi_structured 1/1 2024-08-20T22:20:47.9225888Z test_jit_autocast 1/1 2024-08-20T22:20:47.9226214Z test_masked 1/1 2024-08-20T22:20:47.9226534Z inductor/test_memory_planning 1/1 2024-08-20T22:20:47.9226947Z dynamo/test_aot_autograd_cache 1/1 2024-08-20T22:20:47.9227345Z functorch/test_dims 1/1 2024-08-20T22:20:47.9227690Z inductor/test_b2b_gemm 1/1 2024-08-20T22:20:47.9228153Z test_scatter_gather_ops 1/1 2024-08-20T22:20:47.9228525Z dynamo/test_hooks 1/1 2024-08-20T22:20:47.9228917Z torch_np/numpy_tests/core/test_multiarray 1/1 2024-08-20T22:20:47.9229372Z export/test_converter 1/1 2024-08-20T22:20:47.9229748Z inductor/test_combo_kernels 1/1 2024-08-20T22:20:47.9230132Z test_tensorboard 1/1 2024-08-20T22:20:47.9230489Z export/test_export_nonstrict 1/1 2024-08-20T22:20:47.9230942Z inductor/test_torchinductor_strided_blocks 1/1 2024-08-20T22:20:47.9231395Z test_jiterator 1/1 2024-08-20T22:20:47.9231722Z test_expanded_weights 1/1 2024-08-20T22:20:47.9232201Z inductor/test_debug_trace 1/1 2024-08-20T22:20:47.9232613Z dynamo/test_backward_higher_order_ops 1/1 2024-08-20T22:20:47.9233039Z test_datapipe 1/1 2024-08-20T22:20:47.9233378Z inductor/test_autoheuristic 1/1 2024-08-20T22:20:47.9233787Z functorch/test_eager_transforms 1/1 2024-08-20T22:20:47.9234192Z test_custom_ops 1/1 2024-08-20T22:20:47.9234534Z test_type_promotion 1/1 2024-08-20T22:20:47.9234904Z dynamo/test_input_attr_tracking 1/1 2024-08-20T22:20:47.9235319Z inductor/test_torchbind 1/1 2024-08-20T22:20:47.9235710Z inductor/test_compile_worker 1/1 2024-08-20T22:20:47.9236140Z torch_np/numpy_tests/fft/test_helper 1/1 2024-08-20T22:20:47.9236567Z test_stateless 1/1 2024-08-20T22:20:47.9236891Z test_mkl_verbose 1/1 2024-08-20T22:20:47.9237245Z inductor/test_aot_inductor_utils 1/1 2024-08-20T22:20:47.9237642Z test_hub 1/1 2024-08-20T22:20:47.9237948Z optim/test_swa_utils 1/1 2024-08-20T22:20:47.9251406Z Running inductor/test_max_autotune 1/1 ... [2024-08-20 22:20:47.924748] 2024-08-20T22:20:47.9252049Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:20:47.9255655Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_max_autotune.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:20:47.925105] 2024-08-20T22:24:55.0246059Z 2024-08-20T22:24:55.0247518Z inductor/test_max_autotune 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_max_autotune_1.1_90cbe4a42765cb97_.log 2024-08-20T22:24:55.0269125Z Running 42 items in this shard: test/inductor/test_max_autotune.py::TestMaxAutotune::test_autotune_conv1x1, test/inductor/test_max_autotune.py::TestMaxAutotune::test_autotune_device_guard, test/inductor/test_max_autotune.py::TestMaxAutotune::test_benchmark_choice_fail_in_subproc, test/inductor/test_max_autotune.py::TestMaxAutotune::test_benchmark_choice_in_subproc, test/inductor/test_max_autotune.py::TestMaxAutotune::test_cat_addmm, test/inductor/test_max_autotune.py::TestMaxAutotune::test_conv1x1_with_free_symbols, test/inductor/test_max_autotune.py::TestMaxAutotune::test_conv3d, test/inductor/test_max_autotune.py::TestMaxAutotune::test_conv_backend, test/inductor/test_max_autotune.py::TestMaxAutotune::test_empty_conv_input, test/inductor/test_max_autotune.py::TestMaxAutotune::test_empty_conv_input_with_1x1_kernel, test/inductor/test_max_autotune.py::TestMaxAutotune::test_filled_cache_precompile, test/inductor/test_max_autotune.py::TestMaxAutotune::test_inf_timing_multi_template_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_inf_timing_multi_template_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_jit_fusion_matches_aot_fusion, test/inductor/test_max_autotune.py::TestMaxAutotune::test_matmul_dropout_device_cpu, test/inductor/test_max_autotune.py::TestMaxAutotune::test_matmul_dropout_device_cuda, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_addmm_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_addmm_dynamic_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_addmm_zero_size_input_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_addmm_zero_size_input_dynamic_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_autotune_in_subproc_False_autotune_multi_device_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_autotune_in_subproc_False_autotune_multi_device_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_autotune_in_subproc_True_autotune_multi_device_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_autotune_in_subproc_True_autotune_multi_device_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_zero_size_input_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_mm_plus_mm_zero_size_input_dynamic_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_regular_mm_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_regular_mm_dynamic_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_regular_mm_zero_size_input_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotune::test_max_autotune_regular_mm_zero_size_input_dynamic_True, test/inductor/test_max_autotune.py::TestMaxAutotune::test_no_valid_choices, test/inductor/test_max_autotune.py::TestMaxAutotune::test_non_contiguous_input_addmm, test/inductor/test_max_autotune.py::TestMaxAutotune::test_non_contiguous_input_bmm, test/inductor/test_max_autotune.py::TestMaxAutotune::test_non_contiguous_input_mm, test/inductor/test_max_autotune.py::TestMaxAutotune::test_non_contiguous_input_mm_plus_mm, test/inductor/test_max_autotune.py::TestMaxAutotune::test_precompilation_threads, test/inductor/test_max_autotune.py::TestMaxAutotune::test_precompilations, test/inductor/test_max_autotune.py::TestMaxAutotune::test_triton_template_with_epilogues_and_dynamic_shape, test/inductor/test_max_autotune.py::TestMaxAutotuneRemoteCache::test_max_autotune_remote_caching_dynamic_False, test/inductor/test_max_autotune.py::TestMaxAutotuneRemoteCache::test_max_autotune_remote_caching_dynamic_True, test/inductor/test_max_autotune.py::TestTuningProcess::test_tuning_pool_crash, test/inductor/test_max_autotune.py::TestTuningProcess::test_tuning_pool_multiple_devices 2024-08-20T22:24:55.0287254Z 2024-08-20T22:24:55.0287849Z Running inductor/test_distributed_patterns 1/1 ... [2024-08-20 22:24:55.024484] 2024-08-20T22:24:55.0288475Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:24:55.0290109Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_distributed_patterns.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:24:55.024875] 2024-08-20T22:25:36.1182947Z 2024-08-20T22:25:36.1184598Z inductor/test_distributed_patterns 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_distributed_patterns_1.1_bf928dc8d1ea8e85_.log 2024-08-20T22:25:36.1200754Z Running 19 items in this shard: test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_fake_distributed_aot_eager, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_fake_distributed_inductor, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_intermediate_hook_with_closure, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_aot, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_eager, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_inductor, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_module_backward_hooks_multi_layers, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return2, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return3, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_nn_param_return4, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_nonzero_cpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_nonzero_gpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_zero_cpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_storage_resize_zero_gpu, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_preserve_version_counter1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_preserve_version_counter2, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_set_version_counter1, test/inductor/test_distributed_patterns.py::DistributedPatternTests::test_unsafe_set_version_counter2 2024-08-20T22:25:36.1210084Z 2024-08-20T22:25:36.1210418Z Running test_utils 1/1 ... [2024-08-20 22:25:36.118252] 2024-08-20T22:25:36.1210914Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:25:36.1212412Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:25:36.118717] 2024-08-20T22:26:16.1072343Z 2024-08-20T22:26:16.1073873Z test_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_utils_1.1_49f3548ae768572f_.log 2024-08-20T22:26:16.3704947Z Running 5927 items in this shard: test/test_utils.py::TestCheckpoint::test_checkpoint, test/test_utils.py::TestCheckpoint::test_checkpoint_module_list, test/test_utils.py::TestCheckpoint::test_checkpoint_no_tensors, test/test_utils.py::TestCheckpoint::test_checkpoint_non_tensor, test/test_utils.py::TestCheckpoint::test_checkpoint_non_tensor_inputs_outputs, test/test_utils.py::TestCheckpoint::test_checkpoint_not_preserve_rng_state_and_without_reentrant, test/test_utils.py::TestCheckpoint::test_checkpoint_partial_grad, test/test_utils.py::TestCheckpoint::test_checkpoint_rng_cpu, test/test_utils.py::TestCheckpoint::test_checkpoint_rng_cuda, test/test_utils.py::TestCheckpoint::test_checkpoint_sequential_deprecated_multiple_args, test/test_utils.py::TestCheckpoint::test_checkpoint_sequential_deprecated_no_args, test/test_utils.py::TestCheckpoint::test_checkpoint_trigger, test/test_utils.py::TestCheckpoint::test_checkpoint_valid, test/test_utils.py::TestCheckpoint::test_checkpointing_without_reentrant_early_free, test/test_utils.py::TestCheckpoint::test_get_device_states_recursive, test/test_utils.py::TestCheckpoint::test_infer_device_state_recursive_meta, test/test_utils.py::TestCheckpoint::test_infer_device_state_recursive_multi_cuda, test/test_utils.py::TestDataLoaderUtils::test_multi_drop, test/test_utils.py::TestDataLoaderUtils::test_multi_keep, test/test_utils.py::TestDataLoaderUtils::test_random_seed, test/test_utils.py::TestDataLoaderUtils::test_single_drop, test/test_utils.py::TestDataLoaderUtils::test_single_keep, test/test_utils.py::TestBottleneck::test_bottleneck_cpu_only, test/test_utils.py::TestBottleneck::test_bottleneck_cuda, test/test_utils.py::TestCollectEnv::test_smoke, test/test_utils.py::TestONNXUtils::test_check_onnx_broadcast, test/test_utils.py::TestONNXUtils::test_prepare_onnx_paddings, test/test_utils.py::TestHipify::test_import_hipify, test/test_utils.py::TestHipifyTrie::test_add_and_search_trie, test/test_utils.py::TestHipifyTrie::test_add_multiple_and_search_trie, test/test_utils.py::TestHipifyTrie::test_char_export_trie_to_regex, test/test_utils.py::TestHipifyTrie::test_export_trie_to_regex, test/test_utils.py::TestHipifyTrie::test_prefix_words_export_trie_to_regex, test/test_utils.py::TestHipifyTrie::test_quote_escape, test/test_utils.py::TestHipifyTrie::test_single_export_trie_to_regex, test/test_utils.py::TestHipifyTrie::test_special_char_export_trie_to_regex, test/test_utils.py::TestAssert::test_assert_scriptable, test/test_utils.py::TestAssert::test_assert_true, test/test_utils.py::TestStandaloneCPPJIT::test_load_standalone, test/test_utils.py::TestExtensionUtils::test_external_module_register, test/test_utils.py::TestExtensionUtils::test_external_module_register_with_renamed_backend, test/test_utils.py::TestRenderUtils::test_basic, test/test_utils.py::TestDeviceUtilsCUDA::test_basic_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_decorator_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_decorator_generator_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_H_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_T_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___getitem___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___radd___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rand___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rdiv___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmatmul___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmod___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rmul___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___ror___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rpow___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rsub___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops___rxor___cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__batch_norm_with_update_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__batch_norm_with_update_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__batch_norm_with_update_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__batch_norm_with_update_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__chunk_cat_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__native_batch_norm_legit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__native_batch_norm_legit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__native_batch_norm_legit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__native_batch_norm_legit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_lengths_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_lengths_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_lengths_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_lengths_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_offsets_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_offsets_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_offsets_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__segment_reduce_offsets_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__softmax_backward_data_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__softmax_backward_data_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__softmax_backward_data_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__softmax_backward_data_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__upsample_bilinear2d_aa_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__upsample_bilinear2d_aa_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__upsample_bilinear2d_aa_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops__upsample_bilinear2d_aa_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_abs_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acos_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_acosh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_add_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addbmm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcdiv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addcmul_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmm_decomposed_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addmv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_addr_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_alias_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_all_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_allclose_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_amin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_aminmax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_angle_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_any_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_arange_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argmin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argsort_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_argwhere_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_partial_views_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_as_strided_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_asinh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atan_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atanh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_1d_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_2d_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_atleast_3d_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_baddbmm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bernoulli_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bernoulli_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bernoulli_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bernoulli_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bfloat16_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bincount_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bincount_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bincount_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bincount_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bincount_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_and_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_left_shift_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_left_shift_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_left_shift_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_left_shift_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_left_shift_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_not_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_or_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_right_shift_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_right_shift_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_right_shift_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_right_shift_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_right_shift_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bitwise_xor_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_block_diag_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bmm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bool_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_shapes_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_tensors_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_broadcast_to_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_bucketize_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_byte_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cartesian_prod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cat_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cauchy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cauchy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cauchy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cauchy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdist_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdist_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cdouble_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ceil_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cfloat_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chalf_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_char_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_inverse_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_inverse_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_inverse_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_inverse_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cholesky_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_chunk_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_max_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clamp_min_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_clone_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_column_stack_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_combinations_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_complex_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_complex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_complex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_conj_physical_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_constant_pad_nd_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_contiguous_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_copysign_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_corrcoef_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cos_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cosh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_count_nonzero_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cov_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cross_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cummin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumprod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumsum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_cumulative_trapezoid_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_deg2rad_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diag_embed_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagflat_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diagonal_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_diff_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_digamma_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dist_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_floor_rounding_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_no_rounding_mode_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_div_trunc_rounding_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_double_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dsplit_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_dstack_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_einsum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_like_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_permuted_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_empty_strided_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eq_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_equal_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erf_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfc_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_erfinv_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exp_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_as_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expand_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_expm1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exponential_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exponential_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exponential_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_exponential_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_eye_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_fftshift_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_hfftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ifftshift_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_ihfftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_irfftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfft_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fft_rfftn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fill_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flatten_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flip_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fliplr_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_flipud_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_float_power_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_floor_divide_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_fmod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frac_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frac_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frac_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frac_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_frexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_full_like_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gather_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gcd_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gcd_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gcd_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gcd_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gcd_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ge_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geometric_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geqrf_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geqrf_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geqrf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_geqrf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gradient_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_grid_sampler_2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_grid_sampler_2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_grid_sampler_2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_grid_sampler_2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_gt_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_half_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_heaviside_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_histc_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hsplit_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hstack_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hypot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hypot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hypot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_hypot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_i0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_igamma_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_igamma_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_igammac_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_igammac_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_imag_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_imag_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_imag_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_add_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_fill_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_put_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_amin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_mean_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_reduce_prod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_index_select_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_inner_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_int_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isclose_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isfinite_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isinf_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isnan_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isneginf_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isposinf_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_isreal_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_istft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_istft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_item_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_2inputs_2outputs_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_binary_return_by_ref_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_jiterator_unary_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kron_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_kthvalue_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lcm_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lcm_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lcm_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lcm_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lcm_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ldexp_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_le_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lerp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lgamma_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_ex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_ex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_ex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cholesky_ex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cond_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cond_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cond_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cond_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_cross_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_singular_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_singular_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_singular_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_det_singular_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_diagonal_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eig_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eig_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eig_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eig_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvals_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvals_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvals_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvals_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvalsh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvalsh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvalsh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_eigvalsh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_householder_product_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_householder_product_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_householder_product_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_householder_product_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_ex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_ex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_ex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_inv_ex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_ex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_ex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_ex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_factor_ex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_ldl_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_grad_oriented_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_grad_oriented_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_grad_oriented_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lstsq_grad_oriented_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_ex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_ex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_ex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_factor_ex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_lu_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_power_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_power_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_power_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_power_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_hermitian_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_hermitian_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_hermitian_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_matrix_rank_hermitian_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_multi_dot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_norm_subgradients_at_zero_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_hermitian_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_hermitian_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_hermitian_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_hermitian_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_singular_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_singular_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_singular_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_pinv_singular_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_qr_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_qr_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_qr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_qr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_slogdet_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_slogdet_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_slogdet_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_slogdet_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_ex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_ex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_ex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_ex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_triangular_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_triangular_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_triangular_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_solve_triangular_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svd_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svd_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svd_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svd_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svdvals_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svdvals_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svdvals_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_svdvals_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorinv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorinv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorinv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorinv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorsolve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorsolve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorsolve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_tensorsolve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vander_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vecdot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linalg_vector_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_linspace_tensor_overload_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log10_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log1p_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_normal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_normal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_normal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_normal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_log_softmax_with_dtype_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp2_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logaddexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logcumsumexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logdet_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logdet_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logdet_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logdet_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_and_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_not_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_or_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logical_xor_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logit_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logspace_tensor_overload_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_logsumexp_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_long_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lt_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_unpack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_unpack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_unpack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_lu_unpack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mH_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mT_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_amin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_argmin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumprod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_cumsum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_fill_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_log_softmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_log_softmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_log_softmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_log_softmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logaddexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logaddexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logaddexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logaddexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_logsumexp_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_mean_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_median_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_median_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_median_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_median_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_normalize_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_prod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_select_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_softmin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_std_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_sum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_masked_var_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matmul_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_matrix_exp_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_binary_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_pool2d_with_indices_backward_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_pool2d_with_indices_backward_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_pool2d_with_indices_backward_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_no_dim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_max_reduction_with_dim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_maximum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_median_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_list_of_tensors_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_meshgrid_variadic_tensors_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_binary_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_no_dim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_min_reduction_with_dim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_minimum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mode_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_movedim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_msort_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mul_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_multinomial_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_multinomial_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_multinomial_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_multinomial_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mv_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nan_to_num_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanmedian_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanquantile_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nanquantile_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nansum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_narrow_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_batch_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_batch_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_batch_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_batch_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_dropout_backward_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_dropout_backward_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_dropout_backward_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_dropout_backward_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_layer_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_layer_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_layer_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_native_layer_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ne_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_neg_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_empty_strided_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_full_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_ones_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_new_zeros_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nextafter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nextafter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nextafter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_alpha_dropout_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_alpha_dropout_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_alpha_dropout_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_alpha_dropout_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_avg_pool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_bilinear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_bilinear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_bilinear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_bilinear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_celu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_celu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_celu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_celu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_channel_shuffle_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_conv_transpose3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_similarity_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_similarity_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_similarity_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cosine_similarity_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cross_entropy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cross_entropy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cross_entropy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_cross_entropy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_ctc_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_ctc_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_dropout_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_elu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_elu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_elu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_elu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_bag_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_bag_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_bag_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_bag_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_embedding_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_fractional_max_pool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gaussian_nll_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gaussian_nll_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gaussian_nll_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gelu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gelu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gelu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_gelu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_glu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_glu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_glu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_glu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_grid_sample_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_grid_sample_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_grid_sample_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_grid_sample_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_group_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_group_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_group_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_group_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardshrink_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardshrink_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardshrink_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardshrink_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardsigmoid_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardsigmoid_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardsigmoid_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardsigmoid_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardswish_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardswish_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardswish_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardswish_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hardtanh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hinge_embedding_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hinge_embedding_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_hinge_embedding_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_huber_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_huber_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_huber_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_huber_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_instance_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_instance_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_instance_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_instance_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_area_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_area_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_area_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_area_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bicubic_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bicubic_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bicubic_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bilinear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bilinear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_bilinear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_linear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_linear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_linear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_linear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_nearest_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_trilinear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_trilinear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_interpolate_trilinear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_kl_div_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_kl_div_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_kl_div_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_kl_div_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_l1_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_layer_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_layer_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_layer_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_layer_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_leaky_relu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_leaky_relu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_leaky_relu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_leaky_relu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_linear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_local_response_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_local_response_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_local_response_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_local_response_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_logsigmoid_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_logsigmoid_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_logsigmoid_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_logsigmoid_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_margin_ranking_loss_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool1d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool2d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool3d_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_pool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_grad_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool1d_grad_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_grad_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_grad_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool2d_grad_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_grad_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_grad_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_max_unpool3d_grad_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mish_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mish_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mish_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mish_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mse_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mse_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mse_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_mse_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_head_attention_forward_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_head_attention_forward_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_head_attention_forward_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_margin_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_margin_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multi_margin_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_margin_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_margin_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_margin_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_nll_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_nll_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_nll_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_nll_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_normalize_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_one_hot_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_circular_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_constant_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_reflect_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pad_replicate_negative_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pairwise_distance_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pdist_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pdist_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_shuffle_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_pixel_unshuffle_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_poisson_nll_loss_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_prelu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_prelu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_prelu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_prelu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu6_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_relu_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rms_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rrelu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rrelu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rrelu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_rrelu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_selu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_selu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_selu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_selu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_complex_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_complex_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_silu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_smooth_l1_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_smooth_l1_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_smooth_l1_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_soft_margin_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_soft_margin_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_soft_margin_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softmin_with_dtype_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softplus_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softplus_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softplus_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softplus_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softshrink_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softshrink_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softshrink_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softshrink_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_softsign_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_tanhshrink_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_threshold_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_loss_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_unfold_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_bilinear_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_bilinear_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_bilinear_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_nearest_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_nearest_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_nearest_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_nearest_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nn_functional_upsample_nearest_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_nonzero_static_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_fro_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_inf_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_nuc_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_nuc_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_nuc_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_norm_nuc_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_in_place_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_number_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_number_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_number_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_normal_number_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ones_like_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ormqr_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ormqr_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ormqr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ormqr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_outer_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pca_lowrank_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pca_lowrank_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pca_lowrank_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pca_lowrank_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_permute_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pinverse_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pinverse_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pinverse_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pinverse_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polar_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polar_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_2_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_3_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_polygamma_polygamma_n_4_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_positive_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_pow_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_prod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_put_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_qr_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_qr_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_qr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_qr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_quantile_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_quantile_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rad2deg_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rand_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randint_like_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_randn_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_ravel_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_real_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reciprocal_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_remainder_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_renorm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_repeat_interleave_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_as_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_reshape_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize__cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resize_as__cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_conj_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_resolve_neg_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_roll_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rot90_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_0_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_0_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_3_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_3_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_3_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_3_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_neg_3_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_neg_3_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_neg_3_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_round_decimals_neg_3_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsqrt_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_rsub_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scalar_tensor_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_add_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amax_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_amin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_mean_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_prod_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_scatter_reduce_sum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_searchsorted_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_select_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sgn_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_short_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sigmoid_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sign_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_bartlett_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_bartlett_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_blackman_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_blackman_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_cosine_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_cosine_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_exponential_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_exponential_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_gaussian_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_gaussian_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_general_cosine_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_general_cosine_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_general_hamming_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_general_hamming_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_hamming_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_hamming_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_hann_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_hann_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_kaiser_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_kaiser_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_nuttall_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signal_windows_nuttall_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_signbit_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sin_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinc_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sinh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_slice_scatter_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_softmax_with_dtype_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sort_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_mm_reduce_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_mm_reduce_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_mm_reduce_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_mm_reduce_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_sampled_addmm_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_sampled_addmm_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_sampled_addmm_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sparse_sampled_addmm_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_airy_ai_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_j1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_bessel_y1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_t_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_u_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_v_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_chebyshev_polynomial_w_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_entr_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_erfcx_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_h_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_hermite_polynomial_he_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i0e_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_i1e_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_laguerre_polynomial_l_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_legendre_polynomial_p_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_log_ndtr_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_i1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_modified_bessel_k1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtr_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_ndtri_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_scaled_modified_bessel_k1_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_spherical_bessel_j0_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_xlog1py_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_special_zeta_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_list_args_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_split_with_sizes_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sqrt_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_square_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_squeeze_multiple_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stack_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_mean_unbiased_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_std_unbiased_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stft_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stft_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stft_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_stft_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sub_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_sum_to_size_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_lowrank_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_lowrank_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_lowrank_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_svd_lowrank_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_t_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_along_dim_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_take_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tan_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tanh_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensor_split_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tensordot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tile_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_to_sparse_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_topk_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch__scaled_mm_cuda_float8_e4m3fn, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trace_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_transpose_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapezoid_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trapz_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triangular_solve_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triangular_solve_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triangular_solve_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triangular_solve_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_indices_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_tril_indices_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_indices_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_triu_indices_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_true_divide_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_trunc_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unbind_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unflatten_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unfold_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_uniform_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_consecutive_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_uint16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_uint32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_uint64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unique_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unravel_index_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unravel_index_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unravel_index_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unravel_index_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unravel_index_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_chunk_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsafe_split_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_unsqueeze_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_mean_unbiased_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_var_unbiased_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vdot_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_complex_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_complex_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_complex_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_real_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_as_real_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_copy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_view_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vsplit_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_vstack_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_where_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_xlogy_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zero__cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_bfloat16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_bool, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_complex128, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_complex32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_complex64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_float16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_float32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_float64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_int16, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_int32, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_int64, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_int8, test/test_utils.py::TestDeviceUtilsCUDA::test_device_mode_ops_zeros_like_cuda_uint8, test/test_utils.py::TestDeviceUtilsCUDA::test_get_default_device_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_get_default_device_more_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_nn_module_cuda, test/test_utils.py::TestDeviceUtilsCUDA::test_set_default_device_cuda, test/test_utils.py::TestCppExtensionUtils::test_cc_compiler_is_ok, test/test_utils.py::TestCppExtensionUtils::test_cpp_compiler_is_ok, test/test_utils.py::TestTraceback::test_basic, test/test_utils.py::TestTraceback::test_captured_traceback, test/test_utils.py::TestTraceback::test_captured_traceback_format_all, test/test_utils.py::TestTraceback::test_captured_traceback_format_all_cached, test/test_utils.py::TestTraceback::test_format_traceback_short 2024-08-20T22:26:16.6070929Z 2024-08-20T22:26:16.6071435Z Running test_nn 1/1 ... [2024-08-20 22:26:16.116128] 2024-08-20T22:26:16.6071949Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:26:16.6073460Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_nn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:26:16.116499] 2024-08-20T22:29:24.5814213Z 2024-08-20T22:29:24.5815394Z test_nn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_nn_1.1_ebef799667f24a9c_.log 2024-08-20T22:29:24.7307536Z Running 2333 items in this shard: test/test_nn.py::TestNN::test_AdaptiveLogSoftmax, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda_fp32, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_BCELoss_no_reduce, test/test_nn.py::TestNN::test_BCELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_CELU_no_batch_dim, test/test_nn.py::TestNN::test_CELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_CTCLoss_critical_target_len, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cpu, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cuda, test/test_nn.py::TestNN::test_CTCLoss_long_targets, test/test_nn.py::TestNN::test_CTCLoss_typechecks, test/test_nn.py::TestNN::test_CTCLoss_zero_infinity, test/test_nn.py::TestNN::test_CTCLoss_zero_lengths, test/test_nn.py::TestNN::test_Conv1d, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_dilated, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_groups, test/test_nn.py::TestNN::test_Conv1d_groups_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_groups_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad1, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad1size1, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad2, test/test_nn.py::TestNN::test_Conv1d_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad2size1, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_same, test/test_nn.py::TestNN::test_Conv1d_pad_same2, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_pad_valid, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_stride, test/test_nn.py::TestNN::test_Conv1d_stride_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_stride_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_zero_batch, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda_tf32, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_dilated, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_groups, test/test_nn.py::TestNN::test_Conv2d_groups_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_no_bias, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_pad_same, test/test_nn.py::TestNN::test_Conv2d_pad_same_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_pad_same_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_pad_valid, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_padding, test/test_nn.py::TestNN::test_Conv2d_padding_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_padding_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_strided, test/test_nn.py::TestNN::test_Conv2d_strided_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_strided_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zero_batch, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_dilated, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_dilated_strided, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_groups, test/test_nn.py::TestNN::test_Conv3d_groups_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_groups_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_no_bias, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_same, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_pad_valid, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride, test/test_nn.py::TestNN::test_Conv3d_stride_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride_padding, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_zero_batch, test/test_nn.py::TestNN::test_Conv3d_zero_batch_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_zero_batch_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda_fp32, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_groups, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_groups, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose3d, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda_tf32, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda_fp32, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda_tf32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_CrossMapLRN2d, test/test_nn.py::TestNN::test_CrossMapLRN2d_cuda, test/test_nn.py::TestNN::test_ELU_no_batch_dim, test/test_nn.py::TestNN::test_ELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Embedding, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max, test/test_nn.py::TestNN::test_EmbeddingBag_max_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean, test/test_nn.py::TestNN::test_EmbeddingBag_mean_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sparse, test/test_nn.py::TestNN::test_EmbeddingBag_sparse_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum, test/test_nn.py::TestNN::test_EmbeddingBag_sum_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx_cuda, test/test_nn.py::TestNN::test_Embedding_cuda, test/test_nn.py::TestNN::test_Embedding_discontiguous, test/test_nn.py::TestNN::test_Embedding_discontiguous_cuda, test/test_nn.py::TestNN::test_Embedding_sparse, test/test_nn.py::TestNN::test_Embedding_sparse_cuda, test/test_nn.py::TestNN::test_Flatten, test/test_nn.py::TestNN::test_Flatten_cuda, test/test_nn.py::TestNN::test_Flatten_no_batch_dim, test/test_nn.py::TestNN::test_Flatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Fold, test/test_nn.py::TestNN::test_Fold_cuda, test/test_nn.py::TestNN::test_Fold_int_input, test/test_nn.py::TestNN::test_Fold_int_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input_cuda, test/test_nn.py::TestNN::test_GELU_no_batch_dim, test/test_nn.py::TestNN::test_GELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_GLU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_HuberLoss_delta, test/test_nn.py::TestNN::test_HuberLoss_delta_cuda, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_L1Loss_no_reduce, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_LSTM_cell, test/test_nn.py::TestNN::test_LSTM_cell_forward_hidden_size, test/test_nn.py::TestNN::test_LSTM_cell_forward_input_size, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_cuda, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval_cuda, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear, test/test_nn.py::TestNN::test_Linear_cuda_fp32, test/test_nn.py::TestNN::test_Linear_cuda_tf32, test/test_nn.py::TestNN::test_Linear_no_batch_dim, test/test_nn.py::TestNN::test_Linear_no_batch_dim_cuda_fp32, test/test_nn.py::TestNN::test_Linear_no_batch_dim_cuda_tf32, test/test_nn.py::TestNN::test_Linear_no_bias, test/test_nn.py::TestNN::test_Linear_no_bias_cuda_fp32, test/test_nn.py::TestNN::test_Linear_no_bias_cuda_tf32, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MSELoss_no_reduce, test/test_nn.py::TestNN::test_MSELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MaxUnpool1d_net, test/test_nn.py::TestNN::test_MaxUnpool1d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net, test/test_nn.py::TestNN::test_MaxUnpool3d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Mish_no_batch_dim, test/test_nn.py::TestNN::test_Mish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ModuleDict, test/test_nn.py::TestNN::test_ModuleList, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_NLLLoss_no_reduce, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg_cuda, test/test_nn.py::TestNN::test_PReLU_backward_requires_grad_false, test/test_nn.py::TestNN::test_PReLU_no_batch_dim, test/test_nn.py::TestNN::test_PReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args_cuda, test/test_nn.py::TestNN::test_ParameterDict, test/test_nn.py::TestNN::test_ParameterDict_replication, test/test_nn.py::TestNN::test_ParameterList, test/test_nn.py::TestNN::test_ParameterList_meta, test/test_nn.py::TestNN::test_ParameterList_replication, test/test_nn.py::TestNN::test_PixelShuffle, test/test_nn.py::TestNN::test_PixelShuffle_cuda, test/test_nn.py::TestNN::test_PixelUnshuffle, test/test_nn.py::TestNN::test_PixelUnshuffle_cuda, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_RNN_cell, test/test_nn.py::TestNN::test_RNN_cell_forward_zero_hidden_size, test/test_nn.py::TestNN::test_RNN_cell_no_broadcasting, test/test_nn.py::TestNN::test_RNN_change_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_no_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_with_dropout, test/test_nn.py::TestNN::test_RNN_cudnn_weight_norm, test/test_nn.py::TestNN::test_RNN_dropout, test/test_nn.py::TestNN::test_RNN_dropout_state, test/test_nn.py::TestNN::test_RNN_input_size_zero, test/test_nn.py::TestNN::test_RNN_nonlinearity, test/test_nn.py::TestNN::test_RNN_nonlinearity_passed_as_arg, test/test_nn.py::TestNN::test_RReLU, test/test_nn.py::TestNN::test_RReLU_cuda, test/test_nn.py::TestNN::test_RReLU_no_batch_dim, test/test_nn.py::TestNN::test_RReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down, test/test_nn.py::TestNN::test_RReLU_with_up_down_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar_cuda, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReLU_no_batch_dim, test/test_nn.py::TestNN::test_ReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d, test/test_nn.py::TestNN::test_ReplicationPad3d_complex, test/test_nn.py::TestNN::test_ReplicationPad3d_complex_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SELU_no_batch_dim, test/test_nn.py::TestNN::test_SELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sequential_add, test/test_nn.py::TestNN::test_Sequential_append, test/test_nn.py::TestNN::test_Sequential_delitem, test/test_nn.py::TestNN::test_Sequential_extend, test/test_nn.py::TestNN::test_Sequential_getitem, test/test_nn.py::TestNN::test_Sequential_iadd, test/test_nn.py::TestNN::test_Sequential_imul, test/test_nn.py::TestNN::test_Sequential_insert, test/test_nn.py::TestNN::test_Sequential_insert_fail_case, test/test_nn.py::TestNN::test_Sequential_mul, test/test_nn.py::TestNN::test_Sequential_pop, test/test_nn.py::TestNN::test_Sequential_rmul, test/test_nn.py::TestNN::test_Sequential_setitem, test/test_nn.py::TestNN::test_Sequential_setitem_named, test/test_nn.py::TestNN::test_SiLU_no_batch_dim, test/test_nn.py::TestNN::test_SiLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_beta_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta_cuda, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_Softplus_no_batch_dim, test/test_nn.py::TestNN::test_Softplus_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softsign_no_batch_dim, test/test_nn.py::TestNN::test_Softsign_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanh_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Threshold_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda_tf32, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda_tf32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda_tf32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda_fp32, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda_tf32, test/test_nn.py::TestNN::test_Transformer_cell, test/test_nn.py::TestNN::test_Transformer_multilayer_coder, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda_fp32, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_fp32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_fp32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_tf32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_fp32, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_tf32, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Unfold, test/test_nn.py::TestNN::test_Unfold_cuda, test/test_nn.py::TestNN::test_Unfold_int_input, test/test_nn.py::TestNN::test_Unfold_int_input_cuda, test/test_nn.py::TestNN::test_adaptive_log_softmax, test/test_nn.py::TestNN::test_add_module, test/test_nn.py::TestNN::test_add_module_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_affine_grid, test/test_nn.py::TestNN::test_affine_grid_3d, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_3, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_3, test/test_nn.py::TestNN::test_affine_grid_error_checking, test/test_nn.py::TestNN::test_assignment, test/test_nn.py::TestNN::test_batch_norm_update_stats, test/test_nn.py::TestNN::test_batchnorm_buffer_update_when_stats_are_not_tracked, test/test_nn.py::TestNN::test_batchnorm_cudnn_half, test/test_nn.py::TestNN::test_batchnorm_cudnn_nhwc, test/test_nn.py::TestNN::test_batchnorm_load_state_dict, test/test_nn.py::TestNN::test_batchnorm_nhwc_cpu, test/test_nn.py::TestNN::test_batchnorm_nhwc_cuda, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_BatchNorm2d, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_SyncBatchNorm, test/test_nn.py::TestNN::test_batchnorm_nonaffine_cuda_half_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_bias_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_less_than_one_value_per_channel, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_mean_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_or_running_mean_have_forward_grad, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_weight_is_not_same_size_as_input, test/test_nn.py::TestNN::test_bce_loss_always_nonnegative, test/test_nn.py::TestNN::test_bce_loss_broadcasts_weights, test/test_nn.py::TestNN::test_bce_loss_input_range, test/test_nn.py::TestNN::test_bce_loss_size_mismatch, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_pos_weights, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_weights, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss_large_tensors_with_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_forward_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bce_with_logits_ones_in_pos_weights_are_the_same_as_none, test/test_nn.py::TestNN::test_bce_with_logits_raises_if_target_and_input_are_different_size, test/test_nn.py::TestNN::test_bce_with_logits_stability, test/test_nn.py::TestNN::test_bce_with_logits_with_pos_weight_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bilinear, test/test_nn.py::TestNN::test_bilinear_broadcasting, test/test_nn.py::TestNN::test_bilinear_no_bias, test/test_nn.py::TestNN::test_bilinear_non_contiguous, test/test_nn.py::TestNN::test_broadcast_double_backwards_gpu, test/test_nn.py::TestNN::test_broadcast_no_grad, test/test_nn.py::TestNN::test_broadcast_not_requiring_grad, test/test_nn.py::TestNN::test_buffer_bad_module_subclass, test/test_nn.py::TestNN::test_buffer_not_persistent, test/test_nn.py::TestNN::test_buffer_not_persistent_assign, test/test_nn.py::TestNN::test_buffer_not_persistent_del, test/test_nn.py::TestNN::test_buffer_not_persistent_load, test/test_nn.py::TestNN::test_buffer_not_persistent_overwrite, test/test_nn.py::TestNN::test_buffers_and_named_buffers, test/test_nn.py::TestNN::test_call_supports_python_dict_output, test/test_nn.py::TestNN::test_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_children, test/test_nn.py::TestNN::test_container_copy, test/test_nn.py::TestNN::test_convert_sync_batchnorm, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_diff_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_nonexpandable_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_invalid_shape, test/test_nn.py::TestNN::test_cosine_embedding_loss_margin_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_with_diff_type, test/test_nn.py::TestNN::test_cosine_similarity, test/test_nn.py::TestNN::test_cross_entropy_loss, test/test_nn.py::TestNN::test_cross_entropy_loss_precision, test/test_nn.py::TestNN::test_cross_entropy_loss_zero_div, test/test_nn.py::TestNN::test_cudnn_forward_exception, test/test_nn.py::TestNN::test_cudnn_rnn_dropout_states_device, test/test_nn.py::TestNN::test_cudnn_weight_format, test/test_nn.py::TestNN::test_cudnn_weight_tying, test/test_nn.py::TestNN::test_dir, test/test_nn.py::TestNN::test_dir_digit, test/test_nn.py::TestNN::test_elu_inplace_gradgrad, test/test_nn.py::TestNN::test_elu_inplace_on_view, test/test_nn.py::TestNN::test_error_RNN_seq_len_zero, test/test_nn.py::TestNN::test_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_get_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_set_extra_state, test/test_nn.py::TestNN::test_extra_state_non_dict, test/test_nn.py::TestNN::test_fb_fc_packed, test/test_nn.py::TestNN::test_flatten, test/test_nn.py::TestNN::test_fold_invalid_arg, test/test_nn.py::TestNN::test_fractional_max_pool2d_invalid_output_ratio, test/test_nn.py::TestNN::test_gaussian_nll_loss_args, test/test_nn.py::TestNN::test_gaussian_nll_loss_broadcasting, test/test_nn.py::TestNN::test_get_buffer, test/test_nn.py::TestNN::test_get_buffer_from_submodules, test/test_nn.py::TestNN::test_getattr_with_property, test/test_nn.py::TestNN::test_grid_sample, test/test_nn.py::TestNN::test_grid_sample_3d, test/test_nn.py::TestNN::test_grid_sample_error_checking, test/test_nn.py::TestNN::test_grid_sample_nearest_neighbor_rounding_mode_consistency, test/test_nn.py::TestNN::test_hardtanh_backward, test/test_nn.py::TestNN::test_hardtanh_inplace_gradgrad, test/test_nn.py::TestNN::test_huber_loss_invalid_delta, test/test_nn.py::TestNN::test_inplace_thnn, test/test_nn.py::TestNN::test_interpolate, test/test_nn.py::TestNN::test_interpolate_bicubic_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_buffer_overflow, test/test_nn.py::TestNN::test_interpolate_illegal_memory_access, test/test_nn.py::TestNN::test_interpolate_linear_1d, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d, test/test_nn.py::TestNN::test_interpolate_nearest_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d, test/test_nn.py::TestNN::test_interpolate_nearest_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_undefined_behavior_casting, test/test_nn.py::TestNN::test_kl_div_log_softmax_target, test/test_nn.py::TestNN::test_kl_div_with_diff_type, test/test_nn.py::TestNN::test_kl_div_with_diff_type_log_target, test/test_nn.py::TestNN::test_l1_loss_correct, test/test_nn.py::TestNN::test_layer_norm_eps, test/test_nn.py::TestNN::test_layer_norm_grads_with_create_graph_flag, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_broadcasting, test/test_nn.py::TestNN::test_linear_raise_on_scalar_input, test/test_nn.py::TestNN::test_log_softmax_dim0, test/test_nn.py::TestNN::test_log_softmax_dim0_cuda, test/test_nn.py::TestNN::test_log_softmax_dim3, test/test_nn.py::TestNN::test_log_softmax_dim3_cuda, test/test_nn.py::TestNN::test_log_softmax_lastdim, test/test_nn.py::TestNN::test_log_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_log_softmax_scalar, test/test_nn.py::TestNN::test_log_softmax_scalar_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial, test/test_nn.py::TestNN::test_log_softmax_spatial_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial_special, test/test_nn.py::TestNN::test_log_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_loss_equal_input_target_shape, test/test_nn.py::TestNN::test_margin_ranking_loss_margin_no_reduce, test/test_nn.py::TestNN::test_margin_ranking_loss_no_reduce, test/test_nn.py::TestNN::test_max_pool1d_invalid_output_size, test/test_nn.py::TestNN::test_module_apply_inplace_op, test/test_nn.py::TestNN::test_module_backcompat, test/test_nn.py::TestNN::test_module_super_init, test/test_nn.py::TestNN::test_module_to_argparse, test/test_nn.py::TestNN::test_modules, test/test_nn.py::TestNN::test_mse_loss_size_warning, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_nn.py::TestNN::test_named_children, test/test_nn.py::TestNN::test_named_modules, test/test_nn.py::TestNN::test_named_parameters_remove_duplicate, test/test_nn.py::TestNN::test_native_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_nested_tensor_from_mask, test/test_nn.py::TestNN::test_nested_tensor_from_mask_error, test/test_nn.py::TestNN::test_no_grad, test/test_nn.py::TestNN::test_non_leaf_parameters, test/test_nn.py::TestNN::test_normalize, test/test_nn.py::TestNN::test_overwrite_module_params_on_conversion, test/test_nn.py::TestNN::test_pack_sequence_batch_sizes_throw, test/test_nn.py::TestNN::test_pad_scalar_error, test/test_nn.py::TestNN::test_padding_list, test/test_nn.py::TestNN::test_pairwise_distance, test/test_nn.py::TestNN::test_parameter_assignment, test/test_nn.py::TestNN::test_parameterlistdict_pickle, test/test_nn.py::TestNN::test_parameterlistdict_setting_attributes, test/test_nn.py::TestNN::test_parameters_and_named_parameters, test/test_nn.py::TestNN::test_parameters_to_vector, test/test_nn.py::TestNN::test_parse_to, test/test_nn.py::TestNN::test_partial_flat_weights, test/test_nn.py::TestNN::test_pdist, test/test_nn.py::TestNN::test_pdist_cpu_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_cuda_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_empty_col, test/test_nn.py::TestNN::test_pdist_empty_row, test/test_nn.py::TestNN::test_pdist_large, test/test_nn.py::TestNN::test_pdist_zeros, test/test_nn.py::TestNN::test_pickle_module_no_weights_only_warning, test/test_nn.py::TestNN::test_pixel_shuffle_nhwc_cpu, test/test_nn.py::TestNN::test_pixel_shuffle_unshuffle, test/test_nn.py::TestNN::test_pointwise_loss_broadcast, test/test_nn.py::TestNN::test_pointwise_loss_target_grad_none_reduction, test/test_nn.py::TestNN::test_projections_errors_on_gru_and_rnn, test/test_nn.py::TestNN::test_projections_lstm_args_check, test/test_nn.py::TestNN::test_projections_lstm_check_device, test/test_nn.py::TestNN::test_projections_lstm_initial_hidden_state, test/test_nn.py::TestNN::test_register_buffer_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_not_tensor, test/test_nn.py::TestNN::test_register_parameter_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_relu_inplace_on_view, test/test_nn.py::TestNN::test_repr, test/test_nn.py::TestNN::test_requires_grad_, test/test_nn.py::TestNN::test_rnn_args_check, test/test_nn.py::TestNN::test_rnn_check_device, test/test_nn.py::TestNN::test_rnn_initial_hidden_state, test/test_nn.py::TestNN::test_rnn_weight_norm, test/test_nn.py::TestNN::test_set_submodule, test/test_nn.py::TestNN::test_share_memory, test/test_nn.py::TestNN::test_smoothl1loss_intergral_target, test/test_nn.py::TestNN::test_smoothl1loss_negative_beta_not_supported, test/test_nn.py::TestNN::test_softmax_functional_dim0, test/test_nn.py::TestNN::test_softmax_functional_dim0_cuda, test/test_nn.py::TestNN::test_softmax_functional_dim3, test/test_nn.py::TestNN::test_softmax_functional_dim3_cuda, test/test_nn.py::TestNN::test_softmax_functional_scalar, test/test_nn.py::TestNN::test_softmax_functional_scalar_cuda, test/test_nn.py::TestNN::test_softmax_lastdim, test/test_nn.py::TestNN::test_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_dtype, test/test_nn.py::TestNN::test_softmax_lastdim_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial, test/test_nn.py::TestNN::test_softmax_spatial_cuda, test/test_nn.py::TestNN::test_softmax_spatial_dtype, test/test_nn.py::TestNN::test_softmax_spatial_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial_special, test/test_nn.py::TestNN::test_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_softmin, test/test_nn.py::TestNN::test_spectral_norm, test/test_nn.py::TestNN::test_spectral_norm_dim, test/test_nn.py::TestNN::test_spectral_norm_forward, test/test_nn.py::TestNN::test_spectral_norm_load_state_dict, test/test_nn.py::TestNN::test_spectral_norm_pickle, test/test_nn.py::TestNN::test_state_dict, test/test_nn.py::TestNN::test_swap_module_params_poisons_acc_grad, test/test_nn.py::TestNN::test_sync_batchnorm_accuracy_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_backward_elemt, test/test_nn.py::TestNN::test_threshold_bfloat16_half, test/test_nn.py::TestNN::test_threshold_int, test/test_nn.py::TestNN::test_to, test/test_nn.py::TestNN::test_train_errors_for_invalid_mode, test/test_nn.py::TestNN::test_transformer_args_check, test/test_nn.py::TestNN::test_transformer_layer_args_check, test/test_nn.py::TestNN::test_transformerdecoder, test/test_nn.py::TestNN::test_transformerdecoderlayer, test/test_nn.py::TestNN::test_transformerdecoderlayer_gelu, test/test_nn.py::TestNN::test_triplet_margin_loss, test/test_nn.py::TestNN::test_triplet_margin_loss_no_reduce, test/test_nn.py::TestNN::test_triplet_margin_loss_swap, test/test_nn.py::TestNN::test_triplet_margin_loss_swap_no_reduce, test/test_nn.py::TestNN::test_type, test/test_nn.py::TestNN::test_unflatten, test/test_nn.py::TestNN::test_unflatten_invalid_arg, test/test_nn.py::TestNN::test_unfold_invalid_arg, test/test_nn.py::TestNN::test_upsamplingBilinear2d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingLinear1d, test/test_nn.py::TestNN::test_upsamplingLinear1d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingTrilinear3d_spatial_invariance, test/test_nn.py::TestNN::test_upsampling_bfloat16, test/test_nn.py::TestNN::test_upsampling_not_recompute_scale_factor, test/test_nn.py::TestNN::test_upsampling_small_scale, test/test_nn.py::TestNN::test_vector_to_parameters, test/test_nn.py::TestNN::test_weight_norm, test/test_nn.py::TestNN::test_weight_norm_pickle, test/test_nn.py::TestNN::test_zero_grad, test/test_nn.py::TestFusionEval::test_fuse_module_eval_numerics, test/test_nn.py::TestConstantPadNd::test_constant_pad_nd, test/test_nn.py::TestConstantPadNd::test_preserves_memory_format, test/test_nn.py::TestAddRelu::test_add_relu, test/test_nn.py::TestAddRelu::test_add_relu_broadcasting, test/test_nn.py::TestFunctionalPickle::test_pickle_softsign, test/test_nn.py::TestFusionUtils::test_fuse_conv_bn_requires_grad, test/test_nn.py::TestFusionUtils::test_fuse_linear_bn_requires_grad, test/test_nn.py::TestUtils::test_consume_prefix_in_state_dict_if_present, test/test_nn.py::TestNNDeviceTypeCUDA::test_BatchNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Bilinear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_empty_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GRU_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_memory_format_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_raises_error_if_one_value_per_group_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm1d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm2d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm3d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LocalResponseNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_warnings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad1d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Transformer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Unfold_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_adaptiveavg_pool1d_shmem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate45_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate90_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_3d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_update_stats_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_channel_shuffle_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_error_if_nonfinite_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_consistent_index_target_and_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_errors_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_weight_ignore_indices_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_with_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_index_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_one_hot_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_all_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_device_mask_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_with_neg_alpha_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_fold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_glu_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_bfloat16_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardsigmoid_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_for_single_spatial_element_during_training_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_less_than_one_value_per_channel_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_invalid_reduction_strings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_weight_bias_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_neg_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_zero_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_linear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_logsigmoid_out_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_lstmcell_backward_only_one_output_grad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_TxT_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_devices_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_forward_with_nans_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_mask_types_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_transformer_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_mish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_non_recursive_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_all_ignored_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_byte_target_matches_long_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_target_dim_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_mismatched_batch_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_out_of_bounds_ignore_index_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_total_weight_is_zero_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nonlinearity_propagate_nan_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_one_hot_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_overwrite_module_params_on_conversion_cpu_device_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_prelu_backward_32bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_replicatepad_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_save_lstm_compatibility_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_silu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_skip_init_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_vs_huber_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smoothl1loss_backward_zero_beta_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_forward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_low_threshold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_negative_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_threshold_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_fast_path_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_default_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_correctness_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_fail_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_rocm_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_rescale_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingnearest2d_backward_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float32 2024-08-20T22:29:24.8568338Z 2024-08-20T22:29:24.8568822Z Running inductor/test_torchinductor_opinfo 3/13 ... [2024-08-20 22:29:24.586526] 2024-08-20T22:29:24.8569450Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:24.8571139Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '-m', 'serial', '--shard-id=3', '--num-shards=13', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:24.586934] 2024-08-20T22:29:32.3668727Z 2024-08-20T22:29:32.3670656Z inductor/test_torchinductor_opinfo 3/13 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_3.13_754488b9a0974eb2_.log 2024-08-20T22:29:32.3672166Z Running 0 items in this shard: 2024-08-20T22:29:32.3672462Z 2024-08-20T22:29:32.3672936Z Running inductor/test_torchinductor_dynamic_shapes 4/6 ... [2024-08-20 22:29:32.366759] 2024-08-20T22:29:32.3673605Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:32.3675885Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'serial', '--shard-id=4', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:32.367194] 2024-08-20T22:29:38.7435815Z 2024-08-20T22:29:38.7437499Z inductor/test_torchinductor_dynamic_shapes 4/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_4.6_e9187bea32da0bdb_.log 2024-08-20T22:29:38.7438860Z Running 0 items in this shard: 2024-08-20T22:29:38.7439181Z 2024-08-20T22:29:38.7440333Z Running inductor/test_torchinductor_dynamic_shapes 5/6 ... [2024-08-20 22:29:38.743302] 2024-08-20T22:29:38.7441137Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:38.7443239Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'serial', '--shard-id=5', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:38.743720] 2024-08-20T22:29:45.0196738Z 2024-08-20T22:29:45.0199048Z inductor/test_torchinductor_dynamic_shapes 5/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_5.6_b0ce7145d0005719_.log 2024-08-20T22:29:45.0201221Z Running 0 items in this shard: 2024-08-20T22:29:45.0201522Z 2024-08-20T22:29:45.0202000Z Running inductor/test_torchinductor_dynamic_shapes 6/6 ... [2024-08-20 22:29:45.019308] 2024-08-20T22:29:45.0202689Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:45.0204754Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'serial', '--shard-id=6', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:45.019700] 2024-08-20T22:29:51.3460988Z 2024-08-20T22:29:51.3462877Z inductor/test_torchinductor_dynamic_shapes 6/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_6.6_44afbbef75406b9b_.log 2024-08-20T22:29:51.3464304Z Running 0 items in this shard: 2024-08-20T22:29:51.3464568Z 2024-08-20T22:29:51.3464943Z Running inductor/test_mmdecomp 1/1 ... [2024-08-20 22:29:51.346100] 2024-08-20T22:29:51.3465507Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:51.3469346Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_mmdecomp.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:51.346529] 2024-08-20T22:29:54.7179318Z 2024-08-20T22:29:54.7180982Z inductor/test_mmdecomp 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_mmdecomp_1.1_a4a2a61d68ecb080_.log 2024-08-20T22:29:54.7182010Z Running 0 items in this shard: 2024-08-20T22:29:54.7182272Z 2024-08-20T22:29:54.7182903Z Running dynamo/test_interop 1/1 ... [2024-08-20 22:29:54.717925] 2024-08-20T22:29:54.7183468Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:54.7187655Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_interop.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:54.718311] 2024-08-20T22:29:57.5396227Z 2024-08-20T22:29:57.5397912Z dynamo/test_interop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_interop_1.1_cdd06ccb6ec3fce6_.log 2024-08-20T22:29:57.5398914Z Running 0 items in this shard: 2024-08-20T22:29:57.5399174Z 2024-08-20T22:29:57.5399792Z Running dynamo/test_logging 1/1 ... [2024-08-20 22:29:57.539651] 2024-08-20T22:29:57.5400423Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:29:57.5405062Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_logging.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:29:57.540063] 2024-08-20T22:30:00.4601774Z 2024-08-20T22:30:00.4603233Z dynamo/test_logging 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_logging_1.1_ec02132695f78500_.log 2024-08-20T22:30:00.4604249Z Running 0 items in this shard: 2024-08-20T22:30:00.4604505Z 2024-08-20T22:30:00.4605478Z Running dynamo/test_exc 1/1 ... [2024-08-20 22:30:00.460211] 2024-08-20T22:30:00.4606016Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:00.4609839Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_exc.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:00.460610] 2024-08-20T22:30:03.5314150Z 2024-08-20T22:30:03.5315767Z dynamo/test_exc 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_exc_1.1_645c1493b3b39705_.log 2024-08-20T22:30:03.5316807Z Running 0 items in this shard: 2024-08-20T22:30:03.5317110Z 2024-08-20T22:30:03.5317536Z Running dynamo/test_global 1/1 ... [2024-08-20 22:30:03.531258] 2024-08-20T22:30:03.5318151Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:03.5320281Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_global.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:03.531646] 2024-08-20T22:30:06.3011649Z 2024-08-20T22:30:06.3013608Z dynamo/test_global 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_global_1.1_e9904aa1427ada16_.log 2024-08-20T22:30:06.3014870Z Running 0 items in this shard: 2024-08-20T22:30:06.3015208Z 2024-08-20T22:30:06.3015658Z Running dynamo/test_unspec 1/1 ... [2024-08-20 22:30:06.301203] 2024-08-20T22:30:06.3016247Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:06.3019740Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_unspec.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:06.301572] 2024-08-20T22:30:09.1210795Z 2024-08-20T22:30:09.1212439Z dynamo/test_unspec 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_unspec_1.1_5da9c400816668ab_.log 2024-08-20T22:30:09.1213562Z Running 0 items in this shard: 2024-08-20T22:30:09.1213902Z 2024-08-20T22:30:09.1214735Z Running inductor/test_cudagraph_trees 1/1 ... [2024-08-20 22:30:09.121137] 2024-08-20T22:30:09.1215338Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:09.1219307Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cudagraph_trees.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:09.121511] 2024-08-20T22:30:14.8971130Z 2024-08-20T22:30:14.8972867Z inductor/test_cudagraph_trees 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cudagraph_trees_1.1_4ed39b64fe3fa186_.log 2024-08-20T22:30:14.8973995Z Running 0 items in this shard: 2024-08-20T22:30:14.8974251Z 2024-08-20T22:30:14.8975974Z Running dynamo/test_ctx_manager 1/1 ... [2024-08-20 22:30:14.897220] 2024-08-20T22:30:14.8976624Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:14.8980301Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_ctx_manager.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:14.897624] 2024-08-20T22:30:18.0190451Z 2024-08-20T22:30:18.0192599Z dynamo/test_ctx_manager 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_ctx_manager_1.1_39bb636c6cc72699_.log 2024-08-20T22:30:18.0193654Z Running 0 items in this shard: 2024-08-20T22:30:18.0193916Z 2024-08-20T22:30:18.0194263Z Running dynamo/test_subgraphs 1/1 ... [2024-08-20 22:30:18.018556] 2024-08-20T22:30:18.0194905Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:18.0196884Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_subgraphs.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:18.018910] 2024-08-20T22:30:20.7885467Z 2024-08-20T22:30:20.7887252Z dynamo/test_subgraphs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_subgraphs_1.1_6d75ef8d2c982096_.log 2024-08-20T22:30:20.7888281Z Running 0 items in this shard: 2024-08-20T22:30:20.7888604Z 2024-08-20T22:30:20.7889011Z Running inductor/test_pattern_matcher 1/1 ... [2024-08-20 22:30:20.788526] 2024-08-20T22:30:20.7889634Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:20.7893306Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pattern_matcher.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:20.788914] 2024-08-20T22:30:24.8612368Z 2024-08-20T22:30:24.8614479Z inductor/test_pattern_matcher 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pattern_matcher_1.1_42859534082d29c5_.log 2024-08-20T22:30:24.8615805Z Running 0 items in this shard: 2024-08-20T22:30:24.8616150Z 2024-08-20T22:30:24.8616562Z Running dynamo/test_autograd_function 1/1 ... [2024-08-20 22:30:24.861210] 2024-08-20T22:30:24.8617166Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:24.8620998Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_autograd_function.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:24.861651] 2024-08-20T22:30:27.8317230Z 2024-08-20T22:30:27.8319813Z dynamo/test_autograd_function 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_autograd_function_1.1_83c351875f80d4c7_.log 2024-08-20T22:30:27.8320971Z Running 0 items in this shard: 2024-08-20T22:30:27.8321227Z 2024-08-20T22:30:27.8323266Z Running dynamo/test_activation_checkpointing 1/1 ... [2024-08-20 22:30:27.831940] 2024-08-20T22:30:27.8324056Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:27.8328275Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_activation_checkpointing.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:27.832359] 2024-08-20T22:30:31.0028441Z 2024-08-20T22:30:31.0030296Z dynamo/test_activation_checkpointing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_activation_checkpointing_1.1_f01f200d89d7ffe6_.log 2024-08-20T22:30:31.0031793Z Running 0 items in this shard: 2024-08-20T22:30:31.0032128Z 2024-08-20T22:30:31.0033564Z Running inductor/test_inductor_freezing 1/1 ... [2024-08-20 22:30:31.003034] 2024-08-20T22:30:31.0034339Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:31.0039099Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_freezing.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:31.003479] 2024-08-20T22:30:37.1326691Z 2024-08-20T22:30:37.1328621Z inductor/test_inductor_freezing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_freezing_1.1_c7bfbc6ba6b8cf25_.log 2024-08-20T22:30:37.1329724Z Running 0 items in this shard: 2024-08-20T22:30:37.1329979Z 2024-08-20T22:30:37.1330471Z Running inductor/test_mkldnn_pattern_matcher 1/1 ... [2024-08-20 22:30:37.132542] 2024-08-20T22:30:37.1331280Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:37.1333612Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_mkldnn_pattern_matcher.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:37.132973] 2024-08-20T22:30:42.5584702Z 2024-08-20T22:30:42.5586537Z inductor/test_mkldnn_pattern_matcher 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_mkldnn_pattern_matcher_1.1_f0af5f6af3934195_.log 2024-08-20T22:30:42.5587834Z Running 0 items in this shard: 2024-08-20T22:30:42.5588165Z 2024-08-20T22:30:42.5588562Z Running inductor/test_aot_inductor 6/16 ... [2024-08-20 22:30:42.558465] 2024-08-20T22:30:42.5589145Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:42.5593537Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'serial', '--shard-id=6', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:42.558907] 2024-08-20T22:30:49.0361941Z 2024-08-20T22:30:49.0365896Z inductor/test_aot_inductor 6/16 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_6.16_29ce288076f824ea_.log 2024-08-20T22:30:49.0366997Z Running 0 items in this shard: 2024-08-20T22:30:49.0367262Z 2024-08-20T22:30:49.0367895Z Running inductor/test_aot_inductor 7/16 ... [2024-08-20 22:30:49.036111] 2024-08-20T22:30:49.0368521Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:49.0370186Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'serial', '--shard-id=7', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:49.036495] 2024-08-20T22:30:55.5142214Z 2024-08-20T22:30:55.5144238Z inductor/test_aot_inductor 7/16 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_7.16_cea4703a7db51c70_.log 2024-08-20T22:30:55.5145381Z Running 0 items in this shard: 2024-08-20T22:30:55.5145638Z 2024-08-20T22:30:55.5146058Z Running inductor/test_aot_inductor 14/16 ... [2024-08-20 22:30:55.513219] 2024-08-20T22:30:55.5146647Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:30:55.5148307Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'serial', '--shard-id=14', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:30:55.513611] 2024-08-20T22:31:01.9910820Z 2024-08-20T22:31:01.9912690Z inductor/test_aot_inductor 14/16 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_14.16_e938f28b87cc8c41_.log 2024-08-20T22:31:01.9913782Z Running 0 items in this shard: 2024-08-20T22:31:01.9914035Z 2024-08-20T22:31:01.9914413Z Running inductor/test_cpu_cpp_wrapper 1/1 ... [2024-08-20 22:31:01.990899] 2024-08-20T22:31:01.9915007Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:31:01.9916855Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_cpp_wrapper.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:31:01.991307] 2024-08-20T22:31:08.4549576Z 2024-08-20T22:31:08.4551185Z inductor/test_cpu_cpp_wrapper 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_cpp_wrapper_1.1_790233f39735ecf4_.log 2024-08-20T22:31:08.4552132Z 2024-08-20T22:31:08.4623260Z Running inductor/test_torchinductor_opinfo 3/13 ... [2024-08-20 22:31:08.461756] 2024-08-20T22:31:08.4624037Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:31:08.4624931Z Running inductor/test_torchinductor_dynamic_shapes 4/6 ... [2024-08-20 22:31:08.461895] 2024-08-20T22:31:08.4625611Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:31:08.4626382Z Running inductor/test_torchinductor_dynamic_shapes 5/6 ... [2024-08-20 22:31:08.462140] 2024-08-20T22:31:08.4627041Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:31:08.4629184Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '-m', 'not serial', '--shard-id=3', '--num-shards=13', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:31:08.462306] 2024-08-20T22:31:08.4632074Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'not serial', '--shard-id=4', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:31:08.462340] 2024-08-20T22:31:08.4635248Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'not serial', '--shard-id=5', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:31:08.462683] 2024-08-20T22:39:00.2555595Z 2024-08-20T22:39:00.2558487Z inductor/test_torchinductor_dynamic_shapes 4/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_4.6_751d6449bb22a520_.log 2024-08-20T22:39:00.2705049Z Running 225 items in this shard: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_abs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_add_complex6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_support_out_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_baddbmm_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_batch_norm_2d_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bfloat16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bool_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_both_scalars_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_default_kwargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_single_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_unbacked_empty_1d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_3d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_nd_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_functional_bn_fuse_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_inference_heuristics_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumprod_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_custom_scan_op_multi_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_trivial_0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtypeview_fusion_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_expm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_real_output_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fill1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_flip_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_float16_to_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fmod_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_boolean_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_generate_rand_fp8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_glu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_grid_sampler_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_hardtanh_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_horizonal_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_remainder_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_failed_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_fallback1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_fallback2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_indirect_load_broadcast_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_input_mutation5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_insignificant_strides_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_isinf2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_l1_loss_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_tensor_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_like_rands2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_like_rands3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_list_clearing_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_log_fp64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_long_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_new_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_new_ones_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_mega_fusion_during_lowering_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_no_op_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pad_cast_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_airy_ai_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_chebyshev_polynomial_u_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_erf_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_expm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_ndtr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_rsqrt_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_rsqrt_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scalar_cpu_tensor_arg_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_add3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scheduler_vertical_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_select_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_silu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_simplify_loops_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumprod_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_triu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unroll_small_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_var_mean_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_as_complex_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_view_detach_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_views4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_inplace_permuted_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_any_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_dtype_device_layout_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin_with_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_as_strided_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d_backward3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_buffer_batch_norm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_buffer_use_after_remove_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_float_ndigits_zero_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_empty_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_single_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_compar_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_complex_fallback_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_computed_buffer_inlining_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_consecutive_split_cumprod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_consecutive_split_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv2d_backward_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv3d_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cudnn_rnn_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cumsum_zero_dim_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dist_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dropout2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dropout_trivial_0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtype_sympy_expr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_as_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expand_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_expm1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_list_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fill1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_float_index_expression_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fractional_max_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_full_like_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_generate_rand_fp8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_glu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_hardswish_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_horizonal_fusion2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_remainder_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_fallback1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_indirect_load_broadcast_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inplace_activations_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_input_mutation2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_input_mutation4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_isinf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_l1_loss_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_broadcast_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_pointwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_like_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logsumexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mixed_mm2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mul_index_expr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multi_gpu_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_narrow_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nll_loss_forward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_permute2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_bessel_j0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_bessel_y1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_chebyshev_polynomial_t_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_chebyshev_polynomial_v_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_digamma_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfc_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfinv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_xlog1py_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pow2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_rand_like_deterministic_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reduction_config_limit_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_relu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_round_correctness_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scalar_cpu_tensor_arg_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_reduce1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_softmax_one_kernel_loop_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sqrt_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_stack_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum_int_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_topk_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transposed_propagates_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_cat_conv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest3d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_as_complex_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_view_on_aliased_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_xblock_divides_xnumel_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_zero_element_mutation_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_constant_fold_uniform_value_dynamic_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_full_recompiles_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_unbacked_stride_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op0_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op5_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op6_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_slice_index_changing_sign_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unbacked_index_select_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unwrap_storage_didnt_work_repro_cuda 2024-08-20T22:39:00.2820958Z 2024-08-20T22:39:03.3632802Z Running inductor/test_torchinductor_dynamic_shapes 6/6 ... [2024-08-20 22:39:03.362636] 2024-08-20T22:39:03.3633630Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:39:03.3635442Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_dynamic_shapes.py', '-m', 'not serial', '--shard-id=6', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:39:03.363010] 2024-08-20T22:40:14.6143547Z 2024-08-20T22:40:14.6145444Z inductor/test_torchinductor_dynamic_shapes 5/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_5.6_d0a003dead724c4a_.log 2024-08-20T22:40:14.6250416Z Running 195 items in this shard: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_add_complex3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_add_inplace_permuted_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_addmm_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_argmin_with_nan_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_argmax_min_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d_backward2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool3d_backward3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bitwise2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bmm2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_int_ndigits_pos_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_config_option_dont_assume_alignment_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_1d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_data_type_propogation_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dist_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div8_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_precision_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_div_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dropout_deterministic_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_embedding_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_expanded_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fallback_mutable_op_list_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fmin_fmax_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fractional_max_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fractional_max_pool2d4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_full_truncation_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_functionalize_rng_wrappers_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_deterministic_fallback_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inplace_resize_as_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_lerp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_like_rands_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linear_mixed_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_log_softmax_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_logcumsumexp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_logcumsumexp_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_logsumexp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d7_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mm_mixed_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mm_views_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_var_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mutations_loop_fusion_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_permute2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_philox_rand_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_bessel_j1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_chebyshev_polynomial_v_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_log_ndtr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_round_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_scaled_modified_bessel_k1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_u_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_sinc_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_xlog1py_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_repeat_interleave_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_resize_as_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reuse_buffers_with_aliasing_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scaled_dot_product_efficient_attention_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter_add2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sgn_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sign_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_mutation1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_mutation2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumsum_low_prec_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_squeeze2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_squeeze_varargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum_keepdims_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tanh_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tmp_not_defined_issue3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_topk_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_uint_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unbacked_floordiv_simplify_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unsqueeze_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_bilinear2d_a_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_vdd_clamp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_vectorized_ops_masked_var_novec_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test__unsafe_masked_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_avg_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_addmm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_with_persistent_cache_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_arange4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_argmax_argmin_with_duplicates_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d_backward4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bfloat16_to_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bitwise2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bitwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bmm1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_buffer_copied_in_graph_with_different_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_extern_kernel_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_of_loops_and_extern_kernel_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_unbacked_empty_1d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_clamp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_complex_memory_overlap_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_1d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_3d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_convolution2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_scan_op_compiled_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_dtypeview_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_embedding_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_exp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_no_mutated_tensors_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fallback_mutable_op_with_return_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fft_real_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_float32_to_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fractional_max_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fuse_large_params_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_hardsigmoid_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_hardtanh_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_flip_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_floordiv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_deterministic_fallback_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_offset_pointwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_layer_norm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_log_softmax_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_masked_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d7_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_min_max_reduction_nan_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mixed_mm3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mixed_mm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_mm_views_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multi_gpu_recompile_on_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_multilayer_var_lowp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_new_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_philox_rand_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_bessel_j1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_expit_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_expm1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_i1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_legendre_polynomial_p_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_ndtr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_psi_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_polar_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randn_like_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_interleave_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_resize_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_rsqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter6_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_add1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scheduler_vertical_fusion1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sgn_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sgn_extremal_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_single_elem_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_bool_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumprod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_squeeze2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_std_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_strided_inputs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sum4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tmp_not_defined_issue2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_triu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_uint_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unsqueeze_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_vdd_clamp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_float_item_neginf_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_full_symbolic_value_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_to_inputs_kernel_nobreak_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_math_ops_op1_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_pad_dynamic_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_shape_as_constant_reciprocal_float_exp_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_sub_constant_folding_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unbacked_cat_backwards_save_data_dependent_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_unbacked_matmul_cuda 2024-08-20T22:40:14.6352916Z 2024-08-20T22:40:17.7385382Z Running inductor/test_mmdecomp 1/1 ... [2024-08-20 22:40:17.737924] 2024-08-20T22:40:17.7386030Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:40:17.7387708Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_mmdecomp.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:40:17.738308] 2024-08-20T22:40:29.0782612Z 2024-08-20T22:40:29.0784874Z inductor/test_mmdecomp 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_mmdecomp_1.1_8091e9ab0f841e91_.log 2024-08-20T22:40:29.0801539Z Running 26 items in this shard: test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_10_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_1_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_2_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_bfloat16_bs_4_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_10_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_1_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_2_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_batched_mm_float32_bs_4_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_bmm_batch2_last_dim_size_is_one_cuda, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_simple_mm_bfloat16_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_simple_mm_float32_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_10_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_1_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_2_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_bfloat16_bs_4_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_10_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_1_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_2_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_float32_bs_4_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_10_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_1_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_2_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_batched_int32_bs_4_cuda_int32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_bfloat16_cuda_bfloat16, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_float32_cuda_float32, test/inductor/test_mmdecomp.py::TestDecompCUDA::test_some_int32_cuda_int32 2024-08-20T22:40:29.0817724Z 2024-08-20T22:40:32.2075420Z Running dynamo/test_interop 1/1 ... [2024-08-20 22:40:32.206921] 2024-08-20T22:40:32.2076040Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:40:32.2078012Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_interop.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:40:32.207345] 2024-08-20T22:40:35.9292513Z 2024-08-20T22:40:35.9293990Z dynamo/test_interop 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_interop_1.1_bfdf717c06dc6d1c_.log 2024-08-20T22:40:35.9296503Z Running 4 items in this shard: test/dynamo/test_interop.py::InteropTests::test_fx_fn, test/dynamo/test_interop.py::InteropTests::test_script_fn, test/dynamo/test_interop.py::InteropTests::test_trace_fn, test/dynamo/test_interop.py::InteropTests::test_vmap_in_graph 2024-08-20T22:40:35.9297827Z 2024-08-20T22:40:39.1093668Z Running dynamo/test_logging 1/1 ... [2024-08-20 22:40:39.108685] 2024-08-20T22:40:39.1094292Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:40:39.1096126Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_logging.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:40:39.109067] 2024-08-20T22:41:03.6762785Z 2024-08-20T22:41:03.6764717Z dynamo/test_logging 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_logging_1.1_d24298d44e58599e_.log 2024-08-20T22:41:03.6778903Z Running 41 items in this shard: test/dynamo/test_logging.py::LoggingTests::test_all, test/dynamo/test_logging.py::LoggingTests::test_aot, test/dynamo/test_logging.py::LoggingTests::test_aot_graphs, test/dynamo/test_logging.py::LoggingTests::test_aot_joint_graph, test/dynamo/test_logging.py::LoggingTests::test_bytecode, test/dynamo/test_logging.py::LoggingTests::test_cudagraph_static_inputs, test/dynamo/test_logging.py::LoggingTests::test_cudagraphs, test/dynamo/test_logging.py::LoggingTests::test_custom_format, test/dynamo/test_logging.py::LoggingTests::test_custom_format_exc, test/dynamo/test_logging.py::LoggingTests::test_ddp_graphs, test/dynamo/test_logging.py::LoggingTests::test_default_logging, test/dynamo/test_logging.py::LoggingTests::test_distributed_rank_logging, test/dynamo/test_logging.py::LoggingTests::test_dump_compile_times, test/dynamo/test_logging.py::LoggingTests::test_dynamo_debug, test/dynamo/test_logging.py::LoggingTests::test_dynamo_debug_default_off_artifacts, test/dynamo/test_logging.py::LoggingTests::test_dynamo_error, test/dynamo/test_logging.py::LoggingTests::test_dynamo_info, test/dynamo/test_logging.py::LoggingTests::test_fusion, test/dynamo/test_logging.py::LoggingTests::test_graph_breaks, test/dynamo/test_logging.py::LoggingTests::test_guards_recompiles, test/dynamo/test_logging.py::LoggingTests::test_inductor_debug, test/dynamo/test_logging.py::LoggingTests::test_inductor_error, test/dynamo/test_logging.py::LoggingTests::test_inductor_info, test/dynamo/test_logging.py::LoggingTests::test_invalid_artifact_flag, test/dynamo/test_logging.py::LoggingTests::test_kernel_code, test/dynamo/test_logging.py::LoggingTests::test_logs_out, test/dynamo/test_logging.py::LoggingTests::test_multiline_format, test/dynamo/test_logging.py::LoggingTests::test_open_registration, test/dynamo/test_logging.py::LoggingTests::test_open_registration_python_api, test/dynamo/test_logging.py::LoggingTests::test_open_registration_with_registered_parent, test/dynamo/test_logging.py::LoggingTests::test_output_code, test/dynamo/test_logging.py::LoggingTests::test_recompiles, test/dynamo/test_logging.py::LoggingTests::test_schedule, test/dynamo/test_logging.py::LoggingTests::test_trace_call, test/dynamo/test_logging.py::LoggingTests::test_trace_call_graph_break, test/dynamo/test_logging.py::LoggingTests::test_trace_call_inline_call, test/dynamo/test_logging.py::LoggingTests::test_trace_source_cond, test/dynamo/test_logging.py::LoggingTests::test_trace_source_funcname, test/dynamo/test_logging.py::LoggingTests::test_trace_source_if_stmt, test/dynamo/test_logging.py::LoggingTests::test_trace_source_nested, test/dynamo/test_logging.py::LoggingTests::test_trace_source_simple 2024-08-20T22:41:03.6791308Z 2024-08-20T22:41:06.8195354Z Running dynamo/test_exc 1/1 ... [2024-08-20 22:41:06.818953] 2024-08-20T22:41:06.8195936Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:41:06.8197778Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_exc.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:41:06.819370] 2024-08-20T22:41:10.5412029Z 2024-08-20T22:41:10.5413771Z dynamo/test_exc 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_exc_1.1_dd2fb399061bcfaa_.log 2024-08-20T22:41:10.5417289Z Running 9 items in this shard: test/dynamo/test_exc.py::ExcTests::test_backend_suppress_line, test/dynamo/test_exc.py::ExcTests::test_graph_break_log, test/dynamo/test_exc.py::ExcTests::test_internal_error_no_suppress, test/dynamo/test_exc.py::ExcTests::test_internal_error_suppress_errors, test/dynamo/test_exc.py::ExcTests::test_not_implemented_error, test/dynamo/test_exc.py::ExcTests::test_trigger_bisect_on_error, test/dynamo/test_exc.py::ExcTests::test_trigger_on_error, test/dynamo/test_exc.py::ExcTests::test_unsupported_error, test/dynamo/test_exc.py::ExcTests::test_unsupported_real_stack 2024-08-20T22:41:10.5420078Z 2024-08-20T22:41:13.7468733Z Running dynamo/test_global 1/1 ... [2024-08-20 22:41:13.746213] 2024-08-20T22:41:13.7469388Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:41:13.7471404Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_global.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:41:13.746663] 2024-08-20T22:41:21.2289025Z 2024-08-20T22:41:21.2291045Z dynamo/test_global 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_global_1.1_9d25f6270b81f203_.log 2024-08-20T22:41:21.2297287Z Running 12 items in this shard: test/dynamo/test_global.py::TestGlobals::test_store_global_1, test/dynamo/test_global.py::TestGlobals::test_store_global_2, test/dynamo/test_global.py::TestGlobals::test_store_global_cross_file, test/dynamo/test_global.py::TestGlobals::test_store_global_crossfile_inline, test/dynamo/test_global.py::TestGlobals::test_store_global_dict, test/dynamo/test_global.py::TestGlobals::test_store_global_dict_2, test/dynamo/test_global.py::TestGlobals::test_store_global_inline_1, test/dynamo/test_global.py::TestGlobals::test_store_global_inline_2, test/dynamo/test_global.py::TestGlobals::test_store_global_list, test/dynamo/test_global.py::TestGlobals::test_store_global_list_2, test/dynamo/test_global.py::TestGlobals::test_store_global_new, test/dynamo/test_global.py::TestGlobals::test_store_global_object 2024-08-20T22:41:21.2302505Z 2024-08-20T22:41:24.3861755Z Running dynamo/test_unspec 1/1 ... [2024-08-20 22:41:24.385628] 2024-08-20T22:41:24.3862422Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:41:24.3864305Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_unspec.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:41:24.386037] 2024-08-20T22:41:43.4410165Z 2024-08-20T22:41:43.4412003Z dynamo/test_unspec 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_unspec_1.1_93c118265b6937fb_.log 2024-08-20T22:41:43.4432199Z Running 43 items in this shard: test/dynamo/test_unspec.py::UnspecTests::test_argmin_coerces_symint_to_intlist_spec, test/dynamo/test_unspec.py::UnspecTests::test_bool_tensor_ctor, test/dynamo/test_unspec.py::UnspecTests::test_builtin_functions_on_cuda, test/dynamo/test_unspec.py::UnspecTests::test_builtin_getitem, test/dynamo/test_unspec.py::UnspecTests::test_builtin_max_min, test/dynamo/test_unspec.py::UnspecTests::test_compiled_random_calls_are_random, test/dynamo/test_unspec.py::UnspecTests::test_conv1d_symint_padding, test/dynamo/test_unspec.py::UnspecTests::test_data_dependent_evaluate_expr_graph_break, test/dynamo/test_unspec.py::UnspecTests::test_defaults, test/dynamo/test_unspec.py::UnspecTests::test_exponential, test/dynamo/test_unspec.py::UnspecTests::test_feed_random_values_into_graph_only, test/dynamo/test_unspec.py::UnspecTests::test_isinstance_symint, test/dynamo/test_unspec.py::UnspecTests::test_item_max, test/dynamo/test_unspec.py::UnspecTests::test_mark_01_dynamic, test/dynamo/test_unspec.py::UnspecTests::test_mark_static_inside, test/dynamo/test_unspec.py::UnspecTests::test_mark_unbacked, test/dynamo/test_unspec.py::UnspecTests::test_mark_unbacked_channels_last, test/dynamo/test_unspec.py::UnspecTests::test_mark_unbacked_hint_consistency, test/dynamo/test_unspec.py::UnspecTests::test_multiple_consecutive_random_calls_before_graph, test/dynamo/test_unspec.py::UnspecTests::test_no_recompilations, test/dynamo/test_unspec.py::UnspecTests::test_no_recompiles, test/dynamo/test_unspec.py::UnspecTests::test_no_recompiles_prod_backward, test/dynamo/test_unspec.py::UnspecTests::test_numpy_correctness, test/dynamo/test_unspec.py::UnspecTests::test_propagate_dynamic_dim, test/dynamo/test_unspec.py::UnspecTests::test_prune_torch_check, test/dynamo/test_unspec.py::UnspecTests::test_random_call_with_while_loop, test/dynamo/test_unspec.py::UnspecTests::test_random_object, test/dynamo/test_unspec.py::UnspecTests::test_random_object_methods, test/dynamo/test_unspec.py::UnspecTests::test_random_object_overriden_methods, test/dynamo/test_unspec.py::UnspecTests::test_random_values_with_graph_break, test/dynamo/test_unspec.py::UnspecTests::test_rshift_dynamic, test/dynamo/test_unspec.py::UnspecTests::test_shape_graph_break, test/dynamo/test_unspec.py::UnspecTests::test_specializing_numpy_float_in_control_flow, test/dynamo/test_unspec.py::UnspecTests::test_split_aot_autograd, test/dynamo/test_unspec.py::UnspecTests::test_sum_dimlist_spec, test/dynamo/test_unspec.py::UnspecTests::test_sym_int_conversion, test/dynamo/test_unspec.py::UnspecTests::test_symbol_guard_limit_before_specialize, test/dynamo/test_unspec.py::UnspecTests::test_symfloat_to_tensor, test/dynamo/test_unspec.py::UnspecTests::test_to_tensor, test/dynamo/test_unspec.py::UnspecTests::test_unspec_float_input, test/dynamo/test_unspec.py::UnspecTests::test_unspec_float_output, test/dynamo/test_unspec.py::UnspecTests::test_unspec_float_precision, test/dynamo/test_unspec.py::UnspecTests::test_use_and_specialize 2024-08-20T22:41:43.4451359Z 2024-08-20T22:41:46.7714014Z Running inductor/test_cudagraph_trees 1/1 ... [2024-08-20 22:41:46.770761] 2024-08-20T22:41:46.7714693Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:41:46.7716390Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cudagraph_trees.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:41:46.771182] 2024-08-20T22:43:55.1077117Z 2024-08-20T22:43:55.1079889Z inductor/test_cudagraph_trees 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cudagraph_trees_1.1_f60c058c5a614110_.log 2024-08-20T22:43:55.1141257Z Running 91 items in this shard: test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_accumulate_grad, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_accumulate_multiple_recordings, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_alias_of_parameter, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_aliased_output_checkpoint, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_aliased_static_parameter, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_aliased_storage_single_weakref, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_aliasing_static_ref, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_amp_cache_disabled, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_backward_gets_cached_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_cache_hit_forward_miss_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_cached_forward_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_checkpoint_shared_output_storage_deallocation, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_checkpointing_resets_persistent_refs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_cleanup, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_compiled_autograd_static_input_params, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_constant_output, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_conv_benchmark, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_dynamic_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_dynamic_warmup, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_empty_cpu_tensor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_empty_storage, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_end_recording_early, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_error_on_dealloc_use, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_execution_into_recording, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_expanded_inputs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_due_to_cudagraph_managed_tensor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_fallback_to_eager_if_recompiling_too_many_times_warn_only_once, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_forward_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_forward_backward_not_called_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_forward_generation, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_forward_with_skipped_cudagraphed_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_frozen_fn, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_function_compiled_multiple_times, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_item, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_backend, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_incompatible_cudagraph_ops_nonzero_graph_breaks, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_live_outputs_multiple_graphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_manager_per_device, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mark_step, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_child_node, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_custom_module, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_custom_module_buffer, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_parent_node, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_builtin_module_buffers, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multi_dispatch_single_compile_param_inputs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multiple_devices_msg_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_multiple_insert_removal_caching, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensor_warn_only_once_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_cudagraph_managed_tensors_config_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_on_inp_backend_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_on_inp_backend_inductor, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_mutation_reinplaced, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_not_fallback_to_eager_if_have_not_recompiling_too_many_times, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_output_alias, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_peristed_output_livenes, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_remove_hooks_on_cached_tensors, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_rerecord_if_static_input_address_changed, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_rng_non_trees, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_rng_trees, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_run_simple, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_separate_recordings, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_single_stream_use, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached1, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_skip_if_dynamic_shape_limit_reached2, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_skip_symbolic, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_sparsity, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_static_inputs_address_mutation_log, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_storage_access_error, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_tensor_constant_mutation, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_tensor_dies_between_checkpoint, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_tensor_no_longer_in_pool, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_unaligned_static_input_no_cudagraphs, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_unaligned_static_input_non_trees, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_unaligned_static_input_trees, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_unaligned_static_parameter, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_unstable_ptr, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_warmup_stream_sync, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_warn_on_pending_backward, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_warn_once_if_dynamic_shape_limit_reached, test/inductor/test_cudagraph_trees.py::CudaGraphTreeTests::test_workspace_allocation_error 2024-08-20T22:43:55.1197815Z 2024-08-20T22:43:58.3018950Z Running dynamo/test_ctx_manager 1/1 ... [2024-08-20 22:43:58.301274] 2024-08-20T22:43:58.3019582Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:43:58.3021523Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_ctx_manager.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:43:58.301708] 2024-08-20T22:44:01.0371277Z 2024-08-20T22:44:01.0373177Z inductor/test_torchinductor_opinfo 3/13 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_3.13_8770fb90d6be8e5a_.log 2024-08-20T22:44:01.0523653Z Running 281 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_T_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rmatmul___cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rpow___cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__segment_reduce_offsets_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acosh_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_add_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addbmm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_alias_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_all_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_allclose_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amax_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_angle_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_angle_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argwhere_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_partial_views_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_scatter_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asinh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atan_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_baddbmm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bincount_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_xor_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_to_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bucketize_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cdist_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cfloat_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_char_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cholesky_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chunk_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_max_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clone_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_physical_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_constant_pad_nd_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_contiguous_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cosh_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_count_nonzero_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_count_nonzero_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_scatter_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_scatter_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diff_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dot_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_einsum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_equal_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfinv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_exp_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft2_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ihfftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flip_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_power_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ge_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geometric_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geometric_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geometric_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gt_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_half_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_i0_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_mean_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_mean_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_mean_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isinf_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isinf_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isinf_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isposinf_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isreal_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_binary_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kron_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kthvalue_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_det_singular_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_diagonal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_factor_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_solve_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_slogdet_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_solve_triangular_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_svd_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vander_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vector_norm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linspace_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logaddexp_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_or_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logsumexp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_mean_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_with_dim_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_with_dim_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_msort_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_native_layer_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_ones_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_zeros_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_max_pool2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_max_pool2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_max_pool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_avg_pool1d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_avg_pool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_similarity_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardtanh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_huber_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_kl_div_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_layer_norm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_leaky_relu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_nll_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_one_hot_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_prelu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_scaled_dot_product_attention_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softshrink_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_inf_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_inf_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_normal_in_place_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_pca_lowrank_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polar_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_0_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_pow_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_quantile_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rad2deg_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randint_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randn_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_interleave_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_as_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize__cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize__cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rot90_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_mean_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_sum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_searchsorted_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_searchsorted_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_select_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_blackman_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_kaiser_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_slice_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_softmax_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sparse_mm_reduce_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sparse_sampled_addmm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_airy_ai_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_y0_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_erfcx_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_hermite_polynomial_he_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_split_with_sizes_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sqrt_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_stft_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sub_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_svd_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tanh_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensordot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapezoid_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tril_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_triu_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_true_divide_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trunc_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trunc_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trunc_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unbind_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_var_mean_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_var_unbiased_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vsplit_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_where_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zero__cuda_float64 2024-08-20T22:44:01.0666776Z 2024-08-20T22:44:04.1389893Z Running dynamo/test_subgraphs 1/1 ... [2024-08-20 22:44:04.138313] 2024-08-20T22:44:04.1390587Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:44:04.1393234Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_subgraphs.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:44:04.138734] 2024-08-20T22:44:06.1859148Z 2024-08-20T22:44:06.1860735Z dynamo/test_ctx_manager 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_ctx_manager_1.1_75855fe6ab32fe1c_.log 2024-08-20T22:44:06.1882141Z Running 52 items in this shard: test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_arguments_binding, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break_2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_cpu_graph_break_inner_fn, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_device, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_float64, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_graph_break_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autocast_sdpa, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autograd_profiler, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_autograd_profiler_enabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_grad_mode_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_grad_mode_nested_function_decorator, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_context_wrapping_set_grad_enabled_nested_function, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_amp_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_device, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_across_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_created_outside_of_graph, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_method_create_stream_outside_of_compile, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_event_reconstruct, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_across_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_compared_with_constant, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_compared_with_stream, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_context_manager1, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_context_manager2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_cuda_stream_method, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_prev_disabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_disable_saved_tensors_hooks_prev_disabled_nested, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_context_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_generic_ctx_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_grad_mode_guard, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_graph_break_inlining_autocast, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_graph_break_inlining_grad, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local_nullctx, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_local_nullctx2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_stack, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_inactive_context_graph_break_stack2, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_is_autocast_cpu_enabled, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_generic_context_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_nested_grad_mode_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_no_grad, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_return_context_manager, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_return_context_manager_with_graph_break, test/dynamo/test_ctx_manager.py::CtxManagerTests::test_torch_profiler 2024-08-20T22:44:06.1902165Z 2024-08-20T22:44:09.4074885Z Running inductor/test_pattern_matcher 1/1 ... [2024-08-20 22:44:09.406872] 2024-08-20T22:44:09.4075616Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:44:09.4077706Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pattern_matcher.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:44:09.407276] 2024-08-20T22:44:09.6143608Z 2024-08-20T22:44:09.6145425Z dynamo/test_subgraphs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_subgraphs_1.1_a61c88364af4b0f0_.log 2024-08-20T22:44:09.6160176Z Running 44 items in this shard: test/dynamo/test_subgraphs.py::SubGraphTests::test_capi_call1, test/dynamo/test_subgraphs.py::SubGraphTests::test_capi_call2, test/dynamo/test_subgraphs.py::SubGraphTests::test_capi_call3, test/dynamo/test_subgraphs.py::SubGraphTests::test_control_flow1, test/dynamo/test_subgraphs.py::SubGraphTests::test_control_flow2, test/dynamo/test_subgraphs.py::SubGraphTests::test_control_flow3, test/dynamo/test_subgraphs.py::SubGraphTests::test_control_flow4, test/dynamo/test_subgraphs.py::SubGraphTests::test_control_flow5, test/dynamo/test_subgraphs.py::SubGraphTests::test_dynamic_duck_size, test/dynamo/test_subgraphs.py::SubGraphTests::test_dynamic_getitem, test/dynamo/test_subgraphs.py::SubGraphTests::test_dynamic_kwarg, test/dynamo/test_subgraphs.py::SubGraphTests::test_dynamic_order_dependence, test/dynamo/test_subgraphs.py::SubGraphTests::test_dynamic_zero_inference, test/dynamo/test_subgraphs.py::SubGraphTests::test_enumerate_not_break_graph, test/dynamo/test_subgraphs.py::SubGraphTests::test_extended_args, test/dynamo/test_subgraphs.py::SubGraphTests::test_graph_break_on_item, test/dynamo/test_subgraphs.py::SubGraphTests::test_indirect_unsupported1, test/dynamo/test_subgraphs.py::SubGraphTests::test_indirect_unsupported2, test/dynamo/test_subgraphs.py::SubGraphTests::test_indirect_unsupported3, test/dynamo/test_subgraphs.py::SubGraphTests::test_multigraph, test/dynamo/test_subgraphs.py::SubGraphTests::test_no_graph_break_on_item, test/dynamo/test_subgraphs.py::SubGraphTests::test_pop_after_resume, test/dynamo/test_subgraphs.py::SubGraphTests::test_restore_range, test/dynamo/test_subgraphs.py::SubGraphTests::test_restore_range_iter, test/dynamo/test_subgraphs.py::SubGraphTests::test_restore_state, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume1, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume2, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume3, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume4, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume5, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_freevars, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_paths_join, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_tuple_iterator, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_with_no_grad1, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_with_no_grad2, test/dynamo/test_subgraphs.py::SubGraphTests::test_resume_with_no_grad3, test/dynamo/test_subgraphs.py::SubGraphTests::test_stack_state1, test/dynamo/test_subgraphs.py::SubGraphTests::test_stack_state2, test/dynamo/test_subgraphs.py::SubGraphTests::test_start1, test/dynamo/test_subgraphs.py::SubGraphTests::test_start2, test/dynamo/test_subgraphs.py::SubGraphTests::test_start3, test/dynamo/test_subgraphs.py::SubGraphTests::test_start4, test/dynamo/test_subgraphs.py::SubGraphTests::test_tuple_iterator_mutate, test/dynamo/test_subgraphs.py::SubGraphTests::test_tuple_iterator_return 2024-08-20T22:44:09.6174475Z 2024-08-20T22:44:12.9472755Z Running dynamo/test_autograd_function 1/1 ... [2024-08-20 22:44:12.946612] 2024-08-20T22:44:12.9473503Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:44:12.9475404Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_autograd_function.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:44:12.947034] 2024-08-20T22:44:21.1801500Z 2024-08-20T22:44:21.1803469Z dynamo/test_autograd_function 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_autograd_function_1.1_ad8e2fcd08863b7b_.log 2024-08-20T22:44:21.1817238Z Running 30 items in this shard: test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_allow_in_graph, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_amp_custom_fwd_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_autograd_function_equivalence, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_autograd_function_has_graph_break, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_backward_returns_none_for_tensor_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_classmethod, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_default_values, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_enum_arg, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_context_mark_and_save, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_context_save_and_mark, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_function_with_bound_free_variable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_linear_setup_context, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_materialize_grad, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_multi_output, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_multiple_different_non_tensor_inputs, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_needs_input_grad, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_once_differentiable, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_print_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_repeated_save_for_backward_calls, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_save_for_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smoke_from_test_autograd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smuggle_symint_issue_111031, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_smuggle_tensor_and_complex_structures, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_stride_in_bwd, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tensor_list_as_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tensor_subclass_intermediary_input, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_triton_kernel_basic, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_triton_kernel_multiple_out, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_tuple_arg, test/dynamo/test_autograd_function.py::AutogradFunctionTests::test_user_defined_object_as_input 2024-08-20T22:44:21.1830031Z 2024-08-20T22:44:24.5295479Z Running dynamo/test_activation_checkpointing 1/1 ... [2024-08-20 22:44:24.528880] 2024-08-20T22:44:24.5296174Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:44:24.5297937Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_activation_checkpointing.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:44:24.529266] 2024-08-20T22:44:42.2266043Z 2024-08-20T22:44:42.2268002Z dynamo/test_activation_checkpointing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_activation_checkpointing_1.1_7e7764c45dd50549_.log 2024-08-20T22:44:42.2285269Z Running 29 items in this shard: test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_autocast_flash_attention, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_custom_rule, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_inplace_op, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_invalid_context, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_must_not_recompute_gemm, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_must_recompute, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_outplace_op, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_parametrization, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_partial_ctx_fn, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_random_op, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_compile_selective_checkpoint_tensor_subclass, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_distributed_utils_checkpoint_wrapper, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_dynamo_does_not_trace_getattr_as_top_frame, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_error_msg, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_fallback, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_kwargs, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_list_inputs, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_pattern_matcher, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_symints_location, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_decomps, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_dropout, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_function, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_function_via_global_checkpoint, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_function_with_kwargs, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_module, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_multiple_checkpoints, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_rand, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_recomputed_rand, test/dynamo/test_activation_checkpointing.py::ActivationCheckpointingViaTagsTests::test_tags_sequential_layers 2024-08-20T22:44:42.2301734Z 2024-08-20T22:44:45.4601249Z Running inductor/test_inductor_freezing 1/1 ... [2024-08-20 22:44:45.459523] 2024-08-20T22:44:45.4602257Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:44:45.4605092Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_freezing.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:44:45.459969] 2024-08-20T22:46:11.2457037Z 2024-08-20T22:46:11.2458766Z inductor/test_inductor_freezing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_freezing_1.1_8de405f949c5ce50_.log 2024-08-20T22:46:11.2478950Z Running 44 items in this shard: test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_aliased_param_return_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_autocast_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_functional_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_layout_convert_with_view_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_multiple_uses_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_weight_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_with_as_strided_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_cpp_wrapper_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_dont_change_dtype_folding_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_error_on_eager_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_functional_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mm_concat_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mutation_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_param_deallocated_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_redundant_clone_for_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_rng_op_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_symint_not_folded_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unequal_bias_horizontal_addmm_fusion_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unfolded_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_aliased_param_return_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_autocast_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_functional_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_layout_convert_with_view_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_multiple_uses_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_weight_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_conv_with_as_strided_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_cpp_wrapper_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_dont_change_dtype_folding_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_error_on_eager_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_folded_conv_bn_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_folded_conv_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_folded_conv_functional_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_mm_concat_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_mutation_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_param_deallocated_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_redundant_clone_for_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_rng_op_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_symint_not_folded_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_unequal_bias_horizontal_addmm_fusion_cuda, test/inductor/test_inductor_freezing.py::FreezingCudaTests::test_unfolded_bn_cuda 2024-08-20T22:46:11.2497811Z 2024-08-20T22:46:14.3767200Z Running inductor/test_mkldnn_pattern_matcher 1/1 ... [2024-08-20 22:46:14.376185] 2024-08-20T22:46:14.3768190Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:46:14.3770762Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_mkldnn_pattern_matcher.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:46:14.376591] 2024-08-20T22:47:47.7244356Z 2024-08-20T22:47:47.7246608Z inductor/test_pattern_matcher 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pattern_matcher_1.1_df0e9bce35c5ebaf_.log 2024-08-20T22:47:47.7264240Z Running 36 items in this shard: test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_addmm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_addmm_broadcasting_bias, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_addmm_symbolic_scalar, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_cat_addmm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_cat_mm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_cat_slice_cat_cuda, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_cat_splitwithsizes, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_fused_int_mm_mul, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_fused_int_mm_mul_gating, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_match_equivalent_function_invocations1, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_match_equivalent_function_invocations2, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_match_equivalent_function_invocations3, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_match_with_mutation, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_bad_cases, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_cpu, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_epi_works, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_exhaustive_dtypes, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_gating, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_heuristic_no, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mixed_mm_heuristic_yes, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mm_plus_mm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_mutation_op_matching, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_pointless_convert, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_pointless_cumsum, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_remove_pointless_clones, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_scaled_softmax, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_serialized_patterns_up_to_date, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_splitwithsizes_cat, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_stable_topological_sort, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_symint_pattern_matching, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_uint4x2_mixed_mm, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_uint4x2_mixed_mm_epi, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_uint4x2_mixed_mm_fail_to_match, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_uint4x2_mixed_mm_gating_works, test/inductor/test_pattern_matcher.py::TestPatternMatcher::test_unfuse_bias_addmm 2024-08-20T22:47:47.7279124Z 2024-08-20T22:47:50.8826995Z Running inductor/test_aot_inductor 6/16 ... [2024-08-20 22:47:50.882131] 2024-08-20T22:47:50.8827760Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:47:50.8829478Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'not serial', '--shard-id=6', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:47:50.882505] 2024-08-20T22:48:08.6171228Z 2024-08-20T22:48:08.6173107Z inductor/test_torchinductor_dynamic_shapes 6/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_dynamic_shapes_6.6_0a0b3453adb372fa_.log 2024-08-20T22:48:08.6295513Z Running 223 items in this shard: test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test__unsafe_masked_index_put_accumulate_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adaptive_avg_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_adding_tensor_offsets_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_aoti_eager_support_str_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_arange4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_as_strided_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d6_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d7_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d_backward3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool2d_backward4_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_avg_pool3d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bernoulli2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bitwise3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bmm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_add_autotune_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_bucketize_computed_offsets_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_buffer_copied_in_graph_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_builtins_round_float_ndigits_pos_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cat_extern_kernel_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_clamp_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_constant_pad_2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv2d_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_conv_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_convolution3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_cumsum_no_mask_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_custom_op_fixed_layout_channels_last_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_custom_op_fixed_layout_sequential_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtype_mismatch_issue_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_dtype_sympy_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_empty1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_erfc_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fallback_mutable_op_no_mutated_tensors_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fallback_mutable_op_with_return_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fft_real_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_fuse_tiled_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_gather3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_getitem_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_propagation_floordiv_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_as_masked_fill_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_index_put_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_inductor_layout_optimization_input_mutations_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_kernel_names_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_kwargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_broadcast_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_pointwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_large_strided_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_linspace3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_log2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_mul_index_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_any_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_prime_size_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_multilayer_sum_low_prec_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_one_hot_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_output_strides_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pad_view_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_permute1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_digamma_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_exp2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_expit_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_gammaincc_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_i1e_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_shifted_chebyshev_polynomial_t_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_spherical_bessel_j0_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pointwise_xlogy_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_pow2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randint_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_randn_generator_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_reduction_config_limit_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_relu_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_roll_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sdpa_unaligned_mask_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_shape_padding_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sigmoid_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_signbit_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sizehint_issue1_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_slice_scatter_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_softmax_one_kernel_loop_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_softmax_one_kernel_persist_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sort_transpose_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_cumprod_low_prec_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_with_list_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_split_with_sizes_with_unbacked_symints_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_sum_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor3_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_tensor_index_slice_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_to_memory_format_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_unfold_zero_dimension_tensor_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_bicubic2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_bilinear2d_b_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest1d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_upsample_nearest3d_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesCpuTests::test_zeros_dynamic_shapes_cpu, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test__unsafe_masked_index_put_accumulate_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adaptive_max_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_add_complex6_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_adding_tensor_offsets_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_alexnet_prefix_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_override_registration_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_support_str_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_aoti_eager_with_scalar_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_arange1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool2d_backward2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_avg_pool3d_backward3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_baddbmm_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_batch_norm_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bernoulli1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bernoulli2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bitwise3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bmm2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_bucketize_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cat_upcasting_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_clamp_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_const_int32_to_float_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_constant_pad_fill_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_bn_fuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_functional_bn_fuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_conv_inference_heuristics_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_cumsum_inf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_custom_op_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div6_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_div7_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_elu_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_embedding_bag_byte_unpack_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_empty1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fft_real_input_real_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_floordiv_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fmin_fmax_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_fmod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_forced_buffer_realize_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_grid_sampler_2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_abs_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_propagation_device_assert_masked_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_failed_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_index_put_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_inf_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_large_grid_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_lerp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_linear_mixed_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_list_clearing_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_log_fp64_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_logcumsumexp_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_long_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_min_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d8_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_max_pool2d_with_indices_backward6_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_min_max_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_misaligned_address_issue1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_new_ones_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_nll_loss_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_one_hot_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_output_strides_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pixel_shuffle_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_entr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_erfcx_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_i0e_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_log1p_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_log_ndtr_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i0_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_modified_bessel_i1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_multigammaln_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_pointwise_shifted_chebyshev_polynomial_w_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_prod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_profiler_mark_wrapper_call_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_randint_int64_mod_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reflection_pad2d_backward_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reflection_pad2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_reinterpret_dtypeview_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_remove_no_ops_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_repeat_as_strided_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scaled_dot_product_attention_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter5_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_bf16_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_scatter_reduce2_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_mutation3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_slice_scatter3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_softmax_one_kernel_persist_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_sort_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_cumprod_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_with_sizes_with_unbacked_symints_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_split_with_unbacked_symints_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_tensor1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_to_device_constant_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_transpose_add_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unbacked_floordiv_simplify_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unbacked_floordiv_simplify_errors_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unbind_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_unroll_small_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_bilinear2d_a_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_upsample_nearest1d_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_vectorized_ops_masked_var_novec_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_vertical_fusion1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views1_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views3_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_views4_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_where_with_logical_op_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::DynamicShapesGPUTests::test_zeros_dynamic_shapes_cuda, test/inductor/test_torchinductor_dynamic_shapes.py::TestInductorDynamicCUDA::test_item_return_cuda 2024-08-20T22:48:08.6414387Z 2024-08-20T22:48:11.7751338Z Running inductor/test_aot_inductor 7/16 ... [2024-08-20 22:48:11.774501] 2024-08-20T22:48:11.7752104Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:48:11.7754129Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'not serial', '--shard-id=7', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:48:11.774859] 2024-08-20T22:53:26.8794014Z 2024-08-20T22:53:26.8795841Z inductor/test_mkldnn_pattern_matcher 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_mkldnn_pattern_matcher_1.1_2ceeacc2a014a25b_.log 2024-08-20T22:53:26.8837371Z Running 87 items in this shard: test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_add_scalar, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_binary, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_binary_fusion_failed, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_binary_inplace_fusion_failed_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_binary_inplace_fusion_pass_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv2d_unary_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv3d_binary, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv3d_unary_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv_transpose2d_unary_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_conv_transpose3d_unary_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_dynamic_qlinear_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_dynamic_qlinear_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_dynamic_qlinear_qat_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_hardtanh_pattern_fallback, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_leaky_relu_pattern_fallback, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_linear_add_bias, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_linear_binary, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_linear_fp32, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_linear_unary, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_multi_linear_share_same_input, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_add, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_add_relu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_hardswish, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_hardtanh, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_relu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_relu6, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qat_qconv2d_silu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qcat, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_3, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_broadcast_shapes_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_relu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_add_relu_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_dequant_promotion_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_hardswish_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_hardswish_int8_mixed_bf16_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_hardtanh_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_hardtanh_int8_mixed_bf16_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_relu6_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_relu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_relu_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_silu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qconv2d_silu_int8_mixed_bf16_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qflatten, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_add_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_add_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_add_relu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_add_relu_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_dequant_promotion_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_dequant_promotion_cpu_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_dequant_promotion_dynamic_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_dequant_promotion_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_dequant_promotion_int8_mixed_bf16_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_gelu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_gelu_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_input_dim_exceeds_2_and_not_contiguous, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_int8_mixed_bf16_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_int8_mixed_bf16_input_dim_exceeds_2_and_not_contiguous, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_mul_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_relu_cpu, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_relu_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_relu_int8_mixed_bf16, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qlinear_relu_int8_mixed_bf16_input_dim_exceeds_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_qmaxpool2d, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_reproduce_113440_issue_1, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_reproduce_113440_issue_2, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_reproduce_121253_issue, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_reproduce_99842_issue, test/inductor/test_mkldnn_pattern_matcher.py::TestPatternMatcher::test_woq_int8, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_conv2d_binary_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_conv2d_unary_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_conv3d_binary_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_conv3d_unary_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_conv_transpose2d_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_linear_unary_dynamic_shapes, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_multi_linear_share_same_input_dynamic, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_q_attention_block, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_qat_bn_conv2d, test/inductor/test_mkldnn_pattern_matcher.py::TestDynamicPatternMatcher::test_qconv2d_maxpool2d_linear_dynamic_cpu 2024-08-20T22:53:26.8876357Z 2024-08-20T22:53:30.0579323Z Running inductor/test_aot_inductor 14/16 ... [2024-08-20 22:53:30.057331] 2024-08-20T22:53:30.0580134Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:53:30.0582025Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '-m', 'not serial', '--shard-id=14', '--num-shards=16', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:53:30.057715] 2024-08-20T22:56:02.5981273Z 2024-08-20T22:56:02.5983371Z inductor/test_aot_inductor 6/16 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_6.16_95daac6689370c5a_.log 2024-08-20T22:56:02.6027982Z Running 65 items in this shard: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_non_tensor_predicates_dynamic_False_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_freezing_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_with_none_index_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_misaligned_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_poi_multiple_dynamic_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_replicate_on_devices_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_reduce_fallback_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_profiler_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_backed_symbols_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_addmm_multiple_dynamic_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_3_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_reuse_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_simple_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_constant_folding_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_embedding_bag_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fx_gm_return_tuple_validation_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scaled_dot_product_efficient_attention_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_sdpa_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_shifted_constraint_ranges_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_2_num_dims_2_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_buffer_mutation_4_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_buffer_reuse_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_with_reinterpret_view_inputs_outputs_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_constant_original_fqn_and_dtype_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_inf_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_int_list_input_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_multiple_output_alias_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_no_args_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_repeat_interleave_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_scaled_dot_product_efficient_attention_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_shifted_constraint_ranges_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_1_num_dims_2_dynamic_False_autotune_False_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_bmm_multiple_dynamic_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_buffer_reuse_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_use_buffers_from_outer_scope_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_fft_c2c_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_output_misaligned_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_repeated_user_defined_triton_kernel_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_scatter_reduce_fallback_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_simple_split_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_True_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_addmm_multiple_dynamic_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_buffer_mutation_1_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_fx_gm_return_tuple_validation_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_poi_multiple_dynamic_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_replicate_on_devices_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_equal_to_1_arg_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_False_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_False_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_sympy_fn_like_arg_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_while_loop_nested_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_zero_grid_with_unbacked_symbols_non_abi_compatible_cuda 2024-08-20T22:56:02.6071512Z 2024-08-20T22:56:05.7830052Z Running inductor/test_cpu_cpp_wrapper 1/1 ... [2024-08-20 22:56:05.782263] 2024-08-20T22:56:05.7830705Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T22:56:05.7832409Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_cpp_wrapper.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-08-20 22:56:05.782637] 2024-08-20T22:56:12.4711885Z 2024-08-20T22:56:12.4713763Z inductor/test_cpu_cpp_wrapper 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_cpp_wrapper_1.1_7eb9a7d851dfc193_.log 2024-08-20T22:56:12.4714721Z 2024-08-20T23:12:58.1371955Z 2024-08-20T23:12:58.1375758Z PRINTING LOG FILE of inductor/test_aot_inductor 7/16 (test/test-reports/inductor.test_aot_inductor_7.16_5ea37599c1ce7c3e_.log) 2024-08-20T23:12:58.1378135Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-23f02f0912cc5941.xml 2024-08-20T23:12:58.1379551Z ============================= test session starts ============================== 2024-08-20T23:12:58.1380644Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:12:58.1381412Z cachedir: .pytest_cache 2024-08-20T23:12:58.1382419Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:12:58.1383939Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:12:58.1384498Z configfile: pytest.ini 2024-08-20T23:12:58.1385598Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:12:58.1386630Z collecting ... collected 912 items 2024-08-20T23:12:58.1387138Z stepcurrent: Cannot find last run test, not skipping 2024-08-20T23:12:58.1429546Z Running 58 items in this shard: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_amp_fallback_random_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_outer_code_before_after_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_parameters_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_fallback_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_int_list_input_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_grid_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multi_device_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_view_constant_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_fallback_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_mem_leak_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_addmm_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_1_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_nested_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_with_parameters_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fp8_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_linear_freezing_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_complex_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scatter_fallback_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_sympy_expr_arg_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_constant_original_fqn_and_dtype_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_dynamic_smem_above_default_limit_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_fx_gm_return_tuple_validation_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_missing_output_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_non_tensor_input_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_view_outputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_while_loop_simple_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dynamic_smem_above_default_limit_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_fake_tensor_device_validation_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_missing_cubin_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_zero_size_weight_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_buffer_mutation_4_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_int_list_input_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_misc_1_max_autotune_True_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_model_modified_weights_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_output_path_1_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_quantized_linear_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_reuse_kernel_dynamic_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_runtime_checks_fp8_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_zero_size_weight_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_buffer_mutation_2_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_int_list_input_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_nested_tensor_from_jagged_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_scatter_reduce_fallback_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_equal_to_1_float_arg_dynamic_True_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda 2024-08-20T23:12:58.1470759Z 2024-08-20T23:12:58.1471942Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_amp_fallback_random_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [8.2149s] [ 1%] 2024-08-20T23:12:58.1473965Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_outer_code_before_after_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.8394s] [ 3%] 2024-08-20T23:12:58.1475969Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_parameters_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [8.2270s] [ 5%] 2024-08-20T23:12:58.1477884Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_fallback_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.2904s] [ 6%] 2024-08-20T23:12:58.1479832Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_int_list_input_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.1218s] [ 8%] 2024-08-20T23:12:58.1481784Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_grid_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires CUDA) [ 10%] 2024-08-20T23:12:58.1483936Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multi_device_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 22:48:56.365000 41699 torch/_inductor/utils.py:1391] DeviceCopy in input program 2024-08-20T23:12:58.1485284Z PASSED [10.1195s] [ 12%] 2024-08-20T23:12:58.1486501Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.2191s] [ 13%] 2024-08-20T23:12:58.1488833Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_view_constant_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.7304s] [ 15%] 2024-08-20T23:12:58.1490714Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_fallback_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.3616s] [ 17%] 2024-08-20T23:12:58.1493045Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_abi_compatible_cpu SKIPPED [0.0033s] (requires CUDA) [ 18%] 2024-08-20T23:12:58.1495049Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cpu SKIPPED [0.0032s] (requires CUDA) [ 20%] 2024-08-20T23:12:58.1496999Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_mem_leak_abi_compatible_cpu SKIPPED [0.0031s] (requires CUDA) [ 22%] 2024-08-20T23:12:58.1499047Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_addmm_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [7.3839s] [ 24%] 2024-08-20T23:12:58.1501577Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_1_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 25%] 2024-08-20T23:12:58.1503927Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_nested_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [10.3433s] [ 27%] 2024-08-20T23:12:58.1506271Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_with_parameters_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [8.0608s] [ 29%] 2024-08-20T23:12:58.1508499Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fp8_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0002s] (FP8 is only supported on H100+) [ 31%] 2024-08-20T23:12:58.1510778Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_linear_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 32%] 2024-08-20T23:12:58.1513222Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_complex_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 34%] 2024-08-20T23:12:58.1515684Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scatter_fallback_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 36%] 2024-08-20T23:12:58.1518198Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0032s] (requires CUDA) [ 37%] 2024-08-20T23:12:58.1520870Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_sympy_expr_arg_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0031s] (requires CUDA) [ 39%] 2024-08-20T23:12:58.1523426Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0033s] (requires CUDA) [ 41%] 2024-08-20T23:12:58.1526129Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0001s] (Skipped!) [ 43%] 2024-08-20T23:12:58.1529171Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_constant_original_fqn_and_dtype_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [15.4658s] [ 44%] 2024-08-20T23:12:58.1532626Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_dynamic_smem_above_default_limit_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0002s] (Test was marked as expected failure, but does not fail always anymore.) [ 46%] 2024-08-20T23:12:58.1535886Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_fx_gm_return_tuple_validation_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [0.0125s] [ 48%] 2024-08-20T23:12:58.1539017Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_missing_output_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 50%] 2024-08-20T23:12:58.1542318Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_non_tensor_input_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py W0820 22:50:09.069000 41699 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.1544454Z W0820 22:50:09.082000 41699 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.1545444Z W0820 22:50:23.198000 41699 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.1546487Z W0820 22:50:23.211000 41699 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.1547255Z PASSED [29.5111s] [ 51%] 2024-08-20T23:12:58.1549159Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0031s] (requires CUDA) [ 53%] 2024-08-20T23:12:58.1552258Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0030s] (requires CUDA) [ 55%] 2024-08-20T23:12:58.1555244Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_view_outputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [7.3641s] [ 56%] 2024-08-20T23:12:58.1558298Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_while_loop_simple_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 58%] 2024-08-20T23:12:58.1561748Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cuda W0820 22:50:45.967000 41699 torch/_higher_order_ops/cond.py:116] Pred is a Python constant. When used with torch.cond, it executes only one of the branches. If you want torch.cond to perserve two branches, please make the predicate a boolean tensor or a SymBool. 2024-08-20T23:12:58.1564805Z W0820 22:50:45.976000 41699 torch/_higher_order_ops/cond.py:116] Pred is a Python constant. When used with torch.cond, it executes only one of the branches. If you want torch.cond to perserve two branches, please make the predicate a boolean tensor or a SymBool. 2024-08-20T23:12:58.1566383Z PASSED [8.9000s] [ 60%] 2024-08-20T23:12:58.1568478Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dynamic_smem_above_default_limit_abi_compatible_cuda SKIPPED [0.0003s] (Test was marked as expected failure, but does not fail always anymore.) [ 62%] 2024-08-20T23:12:58.1570722Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_fake_tensor_device_validation_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [0.0590s] [ 63%] 2024-08-20T23:12:58.1572674Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_missing_cubin_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [29.0995s] [ 65%] 2024-08-20T23:12:58.1579408Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_zero_size_weight_abi_compatible_cuda <- test/inductor/test_torchinductor.py /tmp/tmp28vmpzrk/cfs4xeeqht7ut4genqxabrexac2ub7jtnl5rphhzpvx3kutkhxgu/c4ydrks5qehnrfatp3kdoozfr7kppb3tng73uzao3fymqztglbxk.cpp: In member function ‘void torch::aot_inductor::AOTInductorModel::run_impl(AtenTensorOpaque**, AtenTensorOpaque**, torch::aot_inductor::DeviceStreamType, AOTIProxyExecutorHandle)’: 2024-08-20T23:12:58.1583256Z /tmp/tmp28vmpzrk/cfs4xeeqht7ut4genqxabrexac2ub7jtnl5rphhzpvx3kutkhxgu/c4ydrks5qehnrfatp3kdoozfr7kppb3tng73uzao3fymqztglbxk.cpp:603:10: warning: variable ‘L__self___net_0_weight’ set but not used [-Wunused-but-set-variable] 2024-08-20T23:12:58.1584776Z 603 | auto L__self___net_0_weight = constants_->at(0); 2024-08-20T23:12:58.1585261Z | ^~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1585743Z PASSED [8.0364s] [ 67%] 2024-08-20T23:12:58.1587123Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_buffer_mutation_4_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0035s] (requires CUDA) [ 68%] 2024-08-20T23:12:58.1589149Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_int_list_input_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.4647s] [ 70%] 2024-08-20T23:12:58.1591069Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_misc_1_max_autotune_True_non_abi_compatible_cpu E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.1592412Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1593132Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.1602680Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.so 2024-08-20T23:12:58.1611314Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1612279Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.1614238Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1617365Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1619527Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1620588Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1622650Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1625792Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.1628013Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1629181Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1631264Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.1633933Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.1636458Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1638128Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.1639075Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1640104Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1642365Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.1644838Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:245:29: required from here 2024-08-20T23:12:58.1647122Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1649237Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1650478Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1651558Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1653887Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.1656506Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:258:29: required from here 2024-08-20T23:12:58.1658774Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1660606Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1661831Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1662922Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1664479Z E0820 22:51:49.368000 41699 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.1666026Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.1666941Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.1668192Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1668965Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.1678226Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.so 2024-08-20T23:12:58.1687302Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1688041Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.1690000Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1693121Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1695324Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1696407Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1698465Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1701571Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.1703776Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1704955Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1707035Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.1709709Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.1712247Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1713914Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.1714866Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1715784Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1718045Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.1720653Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:245:29: required from here 2024-08-20T23:12:58.1723125Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1724966Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1726200Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1727269Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1729702Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.1732180Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:258:29: required from here 2024-08-20T23:12:58.1734443Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/ig/cigntvta32fon65o2mot43lfya2dlgbkhh774pj7k2m2hua7l7qv.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1736280Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1737506Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1738590Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1739445Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.1740240Z E0820 22:51:51.073000 41699 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.1741152Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.1741976Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1742705Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.1752048Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.so 2024-08-20T23:12:58.1760937Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1761673Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.1763613Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1766722Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1769318Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1770410Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1772473Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1775606Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.1777803Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1778988Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1781064Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.1783735Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.1786274Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1787940Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.1788898Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1789803Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1792063Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.1794565Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:201:29: required from here 2024-08-20T23:12:58.1797308Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1799160Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1800517Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1801611Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1803961Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.1806659Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:214:29: required from here 2024-08-20T23:12:58.1808932Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1810757Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1811975Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1813058Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1814615Z E0820 22:51:52.710000 41699 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.1816134Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.1817048Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.1817817Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1818540Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.1827950Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.so 2024-08-20T23:12:58.1836457Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1837178Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.1839095Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1842464Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1844510Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1845567Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1847598Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1850712Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.1852901Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1854062Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1856183Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.1858849Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.1861388Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1863048Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.1863977Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1864870Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1867177Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.1870217Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:201:29: required from here 2024-08-20T23:12:58.1872855Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1874703Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1875977Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1877056Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1879540Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.1882128Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:214:29: required from here 2024-08-20T23:12:58.1884407Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vr/cvr2shce65qvhoiccj2l7wergidk24za7s7ow5hylok5zo5utcex.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1886289Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1887522Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1888599Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.1889465Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.1890253Z E0820 22:51:54.357000 41699 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.1891167Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.1891984Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1892700Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.1902221Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.so 2024-08-20T23:12:58.1910834Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.1911552Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.1913478Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1916602Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1918802Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1919987Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1922041Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1925178Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.1927425Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1928605Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1930684Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.1933379Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.1935938Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1937605Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.1938542Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1939438Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1941722Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.1944215Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:201:29: required from here 2024-08-20T23:12:58.1946745Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1948599Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1949817Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1950889Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1953461Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.1955997Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:214:29: required from here 2024-08-20T23:12:58.1958277Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.1960279Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.1961516Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.1962589Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.1964137Z E0820 22:51:55.988000 41699 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.1965666Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.1966576Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.1967335Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1968425Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.1978257Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.so 2024-08-20T23:12:58.1986796Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.1987512Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.1989462Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.1992737Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.1994787Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.1995840Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.1997892Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2001153Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2003366Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2004532Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2006606Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2009292Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2011855Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2013514Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2014443Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2015336Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2017607Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2020098Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:201:29: required from here 2024-08-20T23:12:58.2022614Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2024456Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2025673Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2026853Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2029199Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2031672Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:214:29: required from here 2024-08-20T23:12:58.2033940Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] /tmp/tmpibuj8djl/vl/cvlc2knbtd5h3wzcbeimb6mdp272wromzpiwjpes2sixcrq37jhl.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2035784Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2037009Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2038095Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2038944Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.2039860Z E0820 22:51:57.665000 41699 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.2040589Z PASSED [25.9667s] [ 72%] 2024-08-20T23:12:58.2041922Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_model_modified_weights_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [16.1824s] [ 74%] 2024-08-20T23:12:58.2043889Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_output_path_1_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.5041s] [ 75%] 2024-08-20T23:12:58.2046179Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_quantized_linear_non_abi_compatible_cpu [W820 22:52:45.219446503 QuantizedLinear.cpp:383] Warning: fbgemm_pack_gemm_matrix_fp16 is deprecated and will be removed in a future PyTorch release. (function operator()) 2024-08-20T23:12:58.2048378Z [W820 22:53:00.149806725 QuantizedLinear.cpp:418] Warning: fbgemm_linear_fp16_weight_fp32_activation is deprecated and will be removed in a future PyTorch release. (function operator()) 2024-08-20T23:12:58.2049515Z PASSED [14.9995s] [ 77%] 2024-08-20T23:12:58.2050803Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_reuse_kernel_dynamic_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [17.0787s] [ 79%] 2024-08-20T23:12:58.2052758Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_runtime_checks_fp8_non_abi_compatible_cpu SKIPPED [0.0002s] (FP8 is only supported on H100+) [ 81%] 2024-08-20T23:12:58.2055053Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (requires CUDA) [ 82%] 2024-08-20T23:12:58.2057330Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_non_abi_compatible_cpu SKIPPED [0.0032s] (requires CUDA) [ 84%] 2024-08-20T23:12:58.2059377Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_zero_size_weight_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.5691s] [ 86%] 2024-08-20T23:12:58.2061343Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_buffer_mutation_2_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.4134s] [ 87%] 2024-08-20T23:12:58.2063406Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_int_list_input_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.9759s] [ 89%] 2024-08-20T23:12:58.2065417Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_nested_tensor_from_jagged_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.7431s] [ 91%] 2024-08-20T23:12:58.2067515Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_scatter_reduce_fallback_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.2062s] [ 93%] 2024-08-20T23:12:58.2069847Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_equal_to_1_float_arg_dynamic_True_non_abi_compatible_cuda PASSED [16.3569s] [ 94%] 2024-08-20T23:12:58.2071777Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_non_abi_compatible_cuda PASSED [21.3354s] [ 96%] 2024-08-20T23:12:58.2073773Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_non_abi_compatible_cuda PASSED [16.2132s] [ 98%] 2024-08-20T23:12:58.2075579Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda PASSED [0.2995s] [100%] 2024-08-20T23:12:58.2076416Z 2024-08-20T23:12:58.2077305Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-23f02f0912cc5941.xml - 2024-08-20T23:12:58.2078720Z ================== 35 passed, 23 skipped in 434.87s (0:07:14) ================== 2024-08-20T23:12:58.2079449Z Got exit code -11 (SIGSEGV) 2024-08-20T23:12:58.2079892Z Retrying single test... 2024-08-20T23:12:58.2080822Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-e2301ff56e81127e.xml 2024-08-20T23:12:58.2081873Z ============================= test session starts ============================== 2024-08-20T23:12:58.2082771Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:12:58.2083458Z cachedir: .pytest_cache 2024-08-20T23:12:58.2084315Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:12:58.2085141Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:12:58.2085539Z configfile: pytest.ini 2024-08-20T23:12:58.2086387Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:12:58.2087291Z collecting ... collected 912 items 2024-08-20T23:12:58.2087800Z stepcurrent: Cannot find last run test, not skipping 2024-08-20T23:12:58.2088285Z Running 58 items in this shard 2024-08-20T23:12:58.2088534Z 2024-08-20T23:12:58.2089746Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_amp_fallback_random_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.9213s] [ 1%] 2024-08-20T23:12:58.2091724Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_outer_code_before_after_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.6231s] [ 3%] 2024-08-20T23:12:58.2093683Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_parameters_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [8.0918s] [ 5%] 2024-08-20T23:12:58.2095590Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_fallback_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.1094s] [ 6%] 2024-08-20T23:12:58.2097645Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_int_list_input_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.0817s] [ 8%] 2024-08-20T23:12:58.2099590Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_grid_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires CUDA) [ 10%] 2024-08-20T23:12:58.2101748Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multi_device_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 23:01:18.032000 47936 torch/_inductor/utils.py:1391] DeviceCopy in input program 2024-08-20T23:12:58.2103093Z PASSED [9.8383s] [ 12%] 2024-08-20T23:12:58.2104302Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_path_1_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.1332s] [ 13%] 2024-08-20T23:12:58.2106227Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_return_view_constant_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.3902s] [ 15%] 2024-08-20T23:12:58.2108158Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_fallback_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.0938s] [ 17%] 2024-08-20T23:12:58.2110061Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_abi_compatible_cpu SKIPPED [0.0032s] (requires CUDA) [ 18%] 2024-08-20T23:12:58.2112058Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cpu SKIPPED [0.0030s] (requires CUDA) [ 20%] 2024-08-20T23:12:58.2114012Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_mem_leak_abi_compatible_cpu SKIPPED [0.0030s] (requires CUDA) [ 22%] 2024-08-20T23:12:58.2116082Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_addmm_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [7.1379s] [ 24%] 2024-08-20T23:12:58.2118422Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_buffer_mutation_1_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 25%] 2024-08-20T23:12:58.2120861Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_nested_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [10.2134s] [ 27%] 2024-08-20T23:12:58.2123179Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_cond_with_parameters_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [8.0551s] [ 29%] 2024-08-20T23:12:58.2125412Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fp8_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0002s] (FP8 is only supported on H100+) [ 31%] 2024-08-20T23:12:58.2127810Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_linear_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 32%] 2024-08-20T23:12:58.2130281Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_runtime_checks_complex_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 34%] 2024-08-20T23:12:58.2132731Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_scatter_fallback_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 36%] 2024-08-20T23:12:58.2135306Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0032s] (requires CUDA) [ 37%] 2024-08-20T23:12:58.2137839Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_sympy_expr_arg_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (requires CUDA) [ 39%] 2024-08-20T23:12:58.2140391Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0034s] (requires CUDA) [ 41%] 2024-08-20T23:12:58.2143118Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0001s] (Skipped!) [ 43%] 2024-08-20T23:12:58.2146113Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_constant_original_fqn_and_dtype_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [15.1001s] [ 44%] 2024-08-20T23:12:58.2149374Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_dynamic_smem_above_default_limit_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0002s] (Test was marked as expected failure, but does not fail always anymore.) [ 46%] 2024-08-20T23:12:58.2152593Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_fx_gm_return_tuple_validation_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [0.0124s] [ 48%] 2024-08-20T23:12:58.2155641Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_missing_output_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 50%] 2024-08-20T23:12:58.2159085Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_non_tensor_input_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py W0820 23:02:29.015000 47936 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.2161351Z W0820 23:02:29.027000 47936 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.2162339Z W0820 23:02:42.795000 47936 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.2163307Z W0820 23:02:42.807000 47936 torch/_dynamo/eval_frame.py:265] could not determine __code__ for aten.add 2024-08-20T23:12:58.2164186Z PASSED [28.6086s] [ 51%] 2024-08-20T23:12:58.2166096Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0030s] (requires CUDA) [ 53%] 2024-08-20T23:12:58.2169513Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0027s] (requires CUDA) [ 55%] 2024-08-20T23:12:58.2172753Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_view_outputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py PASSED [7.1329s] [ 56%] 2024-08-20T23:12:58.2175751Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_while_loop_simple_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 58%] 2024-08-20T23:12:58.2179024Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_non_tensor_predicates_dynamic_True_abi_compatible_cuda W0820 23:03:04.773000 47936 torch/_higher_order_ops/cond.py:116] Pred is a Python constant. When used with torch.cond, it executes only one of the branches. If you want torch.cond to perserve two branches, please make the predicate a boolean tensor or a SymBool. 2024-08-20T23:12:58.2182094Z W0820 23:03:04.780000 47936 torch/_higher_order_ops/cond.py:116] Pred is a Python constant. When used with torch.cond, it executes only one of the branches. If you want torch.cond to perserve two branches, please make the predicate a boolean tensor or a SymBool. 2024-08-20T23:12:58.2183607Z PASSED [8.3935s] [ 60%] 2024-08-20T23:12:58.2185102Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dynamic_smem_above_default_limit_abi_compatible_cuda SKIPPED [0.0002s] (Test was marked as expected failure, but does not fail always anymore.) [ 62%] 2024-08-20T23:12:58.2187293Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_fake_tensor_device_validation_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [0.0355s] [ 63%] 2024-08-20T23:12:58.2189252Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_missing_cubin_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [28.2231s] [ 65%] 2024-08-20T23:12:58.2192638Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_zero_size_weight_abi_compatible_cuda <- test/inductor/test_torchinductor.py /tmp/tmp60_6hri1/cfs4xeeqht7ut4genqxabrexac2ub7jtnl5rphhzpvx3kutkhxgu/ck25qjtkktjd5nrv5igklaul2uu675f7xybtqgi5xcg2bqopp7wz.cpp: In member function ‘void torch::aot_inductor::AOTInductorModel::run_impl(AtenTensorOpaque**, AtenTensorOpaque**, torch::aot_inductor::DeviceStreamType, AOTIProxyExecutorHandle)’: 2024-08-20T23:12:58.2196169Z /tmp/tmp60_6hri1/cfs4xeeqht7ut4genqxabrexac2ub7jtnl5rphhzpvx3kutkhxgu/ck25qjtkktjd5nrv5igklaul2uu675f7xybtqgi5xcg2bqopp7wz.cpp:603:10: warning: variable ‘L__self___net_0_weight’ set but not used [-Wunused-but-set-variable] 2024-08-20T23:12:58.2197654Z 603 | auto L__self___net_0_weight = constants_->at(0); 2024-08-20T23:12:58.2198154Z | ^~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2198626Z PASSED [7.7682s] [ 67%] 2024-08-20T23:12:58.2200103Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_buffer_mutation_4_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0034s] (requires CUDA) [ 68%] 2024-08-20T23:12:58.2202374Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_int_list_input_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [14.9510s] [ 70%] 2024-08-20T23:12:58.2204298Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_misc_1_max_autotune_True_non_abi_compatible_cpu E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.2205655Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2206380Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.2215741Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.so 2024-08-20T23:12:58.2224202Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2224924Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.2226825Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2229887Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2231905Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2232965Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2234980Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2238091Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2240507Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2241681Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2243721Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2246374Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2248952Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2250580Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2251515Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2252400Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2254636Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2257119Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:245:29: required from here 2024-08-20T23:12:58.2259491Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2261301Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2262519Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2263601Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2265896Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2268833Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:258:29: required from here 2024-08-20T23:12:58.2271223Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2273038Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2274282Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2275672Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2277243Z E0820 23:04:05.945000 47936 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.2278789Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.2279719Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.2280597Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2281473Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.2290743Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.so 2024-08-20T23:12:58.2299295Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2300023Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.2301939Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2305023Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2307098Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2308166Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2310202Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2313288Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2315790Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2316965Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2319020Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2321898Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2324388Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2326026Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2326965Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2327870Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2330114Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2332559Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:245:29: required from here 2024-08-20T23:12:58.2334772Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2336578Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2337810Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2338902Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2341214Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2343632Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:258:29: required from here 2024-08-20T23:12:58.2345845Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/i4/ci4ia4uzdogc7vans2eqfyc7pwa4trj2qn5p26n5lu4zgl6kf5z3.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2347727Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2349152Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2350236Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2351095Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.2351891Z E0820 23:04:07.605000 47936 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.2352803Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.2353613Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2354455Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.2363931Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.so 2024-08-20T23:12:58.2372945Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2373678Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.2375640Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2378740Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2380765Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2381825Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2383856Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2386955Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2389498Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2390658Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2392709Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2395540Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2398043Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2399683Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2400759Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2401651Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2403901Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2406356Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:201:29: required from here 2024-08-20T23:12:58.2408579Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2410403Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2411631Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2412724Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2415039Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2417518Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:214:29: required from here 2024-08-20T23:12:58.2419752Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2421581Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2422803Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2424089Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2425640Z E0820 23:04:09.189000 47936 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.2427181Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.2428105Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.2428979Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2429711Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.2439003Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.so 2024-08-20T23:12:58.2447695Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2448427Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.2450349Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2453441Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2455478Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2456536Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2458572Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2461891Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2464071Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2465237Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2467301Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2470926Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2473445Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2475081Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2476009Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2476914Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2479176Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2481754Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:201:29: required from here 2024-08-20T23:12:58.2483990Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2485869Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2487100Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2488181Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2490481Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2492899Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:214:29: required from here 2024-08-20T23:12:58.2495127Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/73/c73ijkjlaty2baggzbtthbrifg46uktjwuxtnjk4sngnzhhegfk6.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2497009Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2498652Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2499731Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2500590Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.2501380Z E0820 23:04:10.804000 47936 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.2502283Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:12:58.2503379Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2504114Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:12:58.2513498Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.so 2024-08-20T23:12:58.2522185Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:12:58.2522920Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:12:58.2524848Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2528021Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2530049Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2531117Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2533155Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2536677Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2538863Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2540016Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2542085Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2544852Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2547380Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2549033Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2549973Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2550888Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2553156Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2555630Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:201:29: required from here 2024-08-20T23:12:58.2557922Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2559991Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2561222Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2562319Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2564641Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2567085Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:214:29: required from here 2024-08-20T23:12:58.2569727Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2571560Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2573140Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2574222Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:12:58.2575785Z E0820 23:04:12.390000 47936 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:12:58.2577328Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:12:58.2578251Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:12:58.2579162Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2579888Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:12:58.2589255Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lc10 -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.so 2024-08-20T23:12:58.2597808Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:12:58.2598534Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:12:58.2600625Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2603759Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:12:58.2605918Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:12:58.2606985Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:12:58.2609030Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:12:58.2612427Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:12:58.2614616Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2615777Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2617849Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:12:58.2620619Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:12:58.2623132Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2624773Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:12:58.2625709Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2626614Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2628891Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:12:58.2631351Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:201:29: required from here 2024-08-20T23:12:58.2633583Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2635414Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2636657Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2637733Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2640199Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:12:58.2642636Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:214:29: required from here 2024-08-20T23:12:58.2644878Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_6s5o9oh/th/cthgafaxfh6tjmggao2rv5ppytcibece57ffcg2cy3n4gfeeriel.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:12:58.2646717Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:12:58.2648150Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:12:58.2649229Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:12:58.2650100Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:12:58.2650908Z E0820 23:04:14.001000 47936 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:12:58.2651639Z PASSED [24.9332s] [ 72%] 2024-08-20T23:12:58.2653082Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_model_modified_weights_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.2515s] [ 74%] 2024-08-20T23:12:58.2655086Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_output_path_1_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.0260s] [ 75%] 2024-08-20T23:12:58.2657340Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_quantized_linear_non_abi_compatible_cpu [W820 23:04:59.371213431 QuantizedLinear.cpp:383] Warning: fbgemm_pack_gemm_matrix_fp16 is deprecated and will be removed in a future PyTorch release. (function operator()) 2024-08-20T23:12:58.2659557Z [W820 23:05:13.831765731 QuantizedLinear.cpp:418] Warning: fbgemm_linear_fp16_weight_fp32_activation is deprecated and will be removed in a future PyTorch release. (function operator()) 2024-08-20T23:12:58.2660721Z PASSED [14.5052s] [ 77%] 2024-08-20T23:12:58.2662043Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_reuse_kernel_dynamic_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [16.5829s] [ 79%] 2024-08-20T23:12:58.2664021Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_runtime_checks_fp8_non_abi_compatible_cpu SKIPPED [0.0002s] (FP8 is only supported on H100+) [ 81%] 2024-08-20T23:12:58.2666206Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires CUDA) [ 82%] 2024-08-20T23:12:58.2668759Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_non_abi_compatible_cpu SKIPPED [0.0030s] (requires CUDA) [ 84%] 2024-08-20T23:12:58.2670826Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_zero_size_weight_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.0526s] [ 86%] 2024-08-20T23:12:58.2672822Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_buffer_mutation_2_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.8014s] [ 87%] 2024-08-20T23:12:58.2674808Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_int_list_input_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.5119s] [ 89%] 2024-08-20T23:12:58.2676886Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_nested_tensor_from_jagged_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.3852s] [ 91%] 2024-08-20T23:12:58.2678943Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_scatter_reduce_fallback_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.6089s] [ 93%] 2024-08-20T23:12:58.2680976Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_equal_to_1_float_arg_dynamic_True_non_abi_compatible_cuda PASSED [15.7113s] [ 94%] 2024-08-20T23:12:58.2683188Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_non_abi_compatible_cuda PASSED [20.6697s] [ 96%] 2024-08-20T23:12:58.2685206Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_non_abi_compatible_cuda PASSED [15.9728s] [ 98%] 2024-08-20T23:12:58.2687087Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda PASSED [0.2921s] [100%] 2024-08-20T23:12:58.2687941Z 2024-08-20T23:12:58.2688825Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-e2301ff56e81127e.xml - 2024-08-20T23:12:58.2690395Z ================== 35 passed, 23 skipped in 421.41s (0:07:01) ================== 2024-08-20T23:12:58.2691136Z Got exit code -11 (SIGSEGV) 2024-08-20T23:12:58.2691499Z Retrying single test... 2024-08-20T23:12:58.2692449Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-2235af948063f604.xml 2024-08-20T23:12:58.2693520Z ============================= test session starts ============================== 2024-08-20T23:12:58.2694429Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:12:58.2695129Z cachedir: .pytest_cache 2024-08-20T23:12:58.2696020Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:12:58.2696871Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:12:58.2697274Z configfile: pytest.ini 2024-08-20T23:12:58.2698143Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:12:58.2699202Z collecting ... collected 912 items / 57 deselected / 855 selected 2024-08-20T23:12:58.2700488Z stepcurrent: skipping 57 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda 2024-08-20T23:12:58.2701600Z Running 1 items in this shard 2024-08-20T23:12:58.2701859Z 2024-08-20T23:12:58.2702689Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda PASSED [1.4061s] [100%] 2024-08-20T23:12:58.2703541Z 2024-08-20T23:12:58.2704429Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-2235af948063f604.xml - 2024-08-20T23:12:58.2705855Z ======================= 1 passed, 57 deselected in 1.49s ======================= 2024-08-20T23:12:58.2706512Z Got exit code 0 2024-08-20T23:12:58.2706994Z Test succeeeded in new process, continuing with the rest of the tests 2024-08-20T23:12:58.2708157Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-5c9d0442aa718d17.xml 2024-08-20T23:12:58.2709225Z ============================= test session starts ============================== 2024-08-20T23:12:58.2710130Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:12:58.2710831Z cachedir: .pytest_cache 2024-08-20T23:12:58.2711699Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:12:58.2712535Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:12:58.2712952Z configfile: pytest.ini 2024-08-20T23:12:58.2713815Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:12:58.2714860Z collecting ... collected 912 items / 58 deselected / 854 selected 2024-08-20T23:12:58.2715464Z stepcurrent: skipping 58 already run items. 2024-08-20T23:12:58.2716082Z Running 0 items in this shard 2024-08-20T23:12:58.2716333Z 2024-08-20T23:12:58.2717221Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-5c9d0442aa718d17.xml - 2024-08-20T23:12:58.2718576Z ============================ 58 deselected in 0.07s ============================ 2024-08-20T23:12:58.2720357Z The following tests failed and then succeeded when run in a new process['ul', 'test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_unsupported_input_dtype_non_abi_compatible_cuda'] 2024-08-20T23:12:58.2721538Z 2024-08-20T23:12:58.2722267Z FINISHED PRINTING LOG FILE of inductor/test_aot_inductor 7/16 (test/test-reports/inductor.test_aot_inductor_7.16_5ea37599c1ce7c3e_.log) 2024-08-20T23:12:58.2723014Z 2024-08-20T23:19:45.9512666Z 2024-08-20T23:19:45.9514250Z PRINTING LOG FILE of inductor/test_aot_inductor 14/16 (test/test-reports/inductor.test_aot_inductor_14.16_12ae829b1c880606_.log) 2024-08-20T23:19:45.9516133Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-a698ea308c44f213.xml 2024-08-20T23:19:45.9518375Z ============================= test session starts ============================== 2024-08-20T23:19:45.9519718Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:19:45.9520691Z cachedir: .pytest_cache 2024-08-20T23:19:45.9521846Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:19:45.9522924Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:19:45.9523326Z configfile: pytest.ini 2024-08-20T23:19:45.9524351Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:19:45.9525559Z collecting ... collected 912 items 2024-08-20T23:19:45.9526217Z stepcurrent: Cannot find last run test, not skipping 2024-08-20T23:19:45.9577012Z Running 60 items in this shard: test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bmm_multiple_dynamic_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_False_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_cubin_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multiple_output_alias_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_normal_functional_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_shape_failed_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_small_constant_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_conv_freezing_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_deconv_freezing_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fft_c2c_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_normal_functional_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_path_1_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_view_outputs_abi_compatible_cpu_with_stack_allocation, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_assert_async_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_misc_1_max_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_normal_functional_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_pytree_inputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_return_constant_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_reuse_kernel_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_seq_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_equal_to_1_arg_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_with_parameters_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dup_unbacked_sym_decl_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_large_mmaped_weights_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_normal_functional_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_runtime_checks_complex_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_simple_split_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_while_loop_with_outer_buffers_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_with_profiler_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_addmm_multiple_dynamic_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_nested_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_with_reinterpret_view_inputs_outputs_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_convolution_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_large_weight_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_shifted_constraint_ranges_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_sympy_fn_like_arg_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_nested_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_simple_non_abi_compatible_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_index_put_fallback_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_output_misaligned_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_repeat_interleave_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_sdpa_2_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_simple_dynamic_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_small_constant_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_reinterpret_view_mem_leak_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_non_abi_compatible_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda 2024-08-20T23:19:45.9617835Z 2024-08-20T23:19:45.9619474Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bmm_multiple_dynamic_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 22:53:36.376000 45523 torch/_inductor/kernel/bmm.py:164] No choices for GEMM, using ATen backend as fallback 2024-08-20T23:19:45.9620988Z PASSED [7.2868s] [ 1%] 2024-08-20T23:19:45.9622248Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [9.3039s] [ 3%] 2024-08-20T23:19:45.9623994Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_False_abi_compatible_cpu PASSED [7.7718s] [ 5%] 2024-08-20T23:19:45.9625709Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_True_abi_compatible_cpu E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:45.9627701Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9628435Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:45.9637395Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.so 2024-08-20T23:19:45.9645769Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9646497Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:45.9648463Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9651613Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:45.9653628Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:45.9654698Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:45.9656723Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9659815Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:45.9661991Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9663149Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9665196Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:45.9668312Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:45.9671093Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9672713Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:45.9673645Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9674535Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9676789Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:45.9679427Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:245:29: required from here 2024-08-20T23:19:45.9681744Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9683564Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9684780Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9685876Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9688172Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:45.9690587Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:258:29: required from here 2024-08-20T23:19:45.9692801Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9694611Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9695824Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9696894Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9698448Z E0820 22:54:02.061000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:45.9699972Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:45.9700881Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:45.9701676Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:45.9702397Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:45.9711504Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.so 2024-08-20T23:19:45.9719727Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:45.9720584Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:45.9722497Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9725588Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:45.9727605Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:45.9728665Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:45.9730696Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9733783Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:45.9735926Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9737075Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9739167Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:45.9741807Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:45.9744454Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9746078Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:45.9747010Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9747902Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9750193Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:45.9752720Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:245:29: required from here 2024-08-20T23:19:45.9754947Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9756769Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9758003Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9759080Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9761546Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:45.9763974Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:258:29: required from here 2024-08-20T23:19:45.9766203Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/k3/ck34vv6kqxnf2srtfhykaa2fpea3zfdh33emppkuyi5yzk7mjgzo.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9768378Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9769622Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9770685Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9771534Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:45.9772329Z E0820 22:54:03.733000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:45.9773227Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:45.9774039Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9774767Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:45.9783965Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.so 2024-08-20T23:19:45.9792247Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9792968Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:45.9794893Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9798000Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:45.9800193Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:45.9801260Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:45.9803296Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9806381Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:45.9808572Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9809755Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9811805Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:45.9814447Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:45.9817132Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9818777Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:45.9819706Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9820598Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9822843Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:45.9825381Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:201:29: required from here 2024-08-20T23:19:45.9827626Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9829451Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9830677Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9831751Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9834075Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:45.9836505Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:214:29: required from here 2024-08-20T23:19:45.9838794Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9840707Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9841922Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9843000Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9844544Z E0820 22:54:05.358000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:45.9846076Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:45.9846979Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:45.9847744Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:45.9848477Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:45.9857542Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.so 2024-08-20T23:19:45.9865778Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:45.9866504Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:45.9868765Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9871895Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:45.9873909Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:45.9874958Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:45.9876993Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9880150Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:45.9882306Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9883452Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9885516Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:45.9888159Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:45.9890897Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9892538Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:45.9893469Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9894358Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9896611Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:45.9899192Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:201:29: required from here 2024-08-20T23:19:45.9901435Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9903249Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9904463Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9905532Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9907849Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:45.9910323Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:214:29: required from here 2024-08-20T23:19:45.9912554Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ww/cwwuhjwsgf4frwiavggwktje4zoh45biqpgdffck7weejlqq3v4u.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9914369Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9915565Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9916636Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:45.9917487Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:45.9918267Z E0820 22:54:07.031000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:45.9919211Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:45.9920104Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9920815Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:45.9929852Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.so 2024-08-20T23:19:45.9938082Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:45.9938796Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:45.9940708Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9943789Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:45.9945819Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:45.9946879Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:45.9948947Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:45.9952043Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:45.9954223Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9955381Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9957426Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:45.9960191Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:45.9962687Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9964504Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:45.9965448Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9966338Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9969020Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:45.9971722Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:201:29: required from here 2024-08-20T23:19:45.9973986Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9975809Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9977020Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9978121Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9980471Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:45.9982901Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:214:29: required from here 2024-08-20T23:19:45.9985140Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:45.9986937Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:45.9988169Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:45.9989233Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:45.9990928Z E0820 22:54:08.651000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:45.9992458Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:45.9993377Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:45.9994149Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:45.9994958Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0003937Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.so 2024-08-20T23:19:46.0012144Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0012856Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0014757Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0017817Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0019827Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0020878Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0022880Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0025965Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0028251Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0029411Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0031459Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0034096Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0036697Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0038333Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0039255Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0040303Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0042550Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0045003Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:201:29: required from here 2024-08-20T23:19:46.0047253Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0049079Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0050293Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0051384Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0053696Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0056146Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:214:29: required from here 2024-08-20T23:19:46.0058386Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_kcuc315/ss/csswso3bsijnn7ibrcsfiqr4puv5fzvbgxpftsppg2iszpuyvqiz.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0060206Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0061429Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0062610Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0063468Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0064251Z E0820 22:54:10.298000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0064963Z PASSED [17.5775s] [ 6%] 2024-08-20T23:19:46.0066265Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_cubin_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (Skipped!) [ 8%] 2024-08-20T23:19:46.0068977Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multiple_output_alias_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 10%] 2024-08-20T23:19:46.0071288Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_normal_functional_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.5565s] [ 11%] 2024-08-20T23:19:46.0073161Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.3359s] [ 13%] 2024-08-20T23:19:46.0075241Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_shape_failed_abi_compatible_cpu <- test/inductor/test_torchinductor.py Error: input_handles[0]: unmatched dim value at 1, expected: 4, but got: 8 2024-08-20T23:19:46.0076387Z 2024-08-20T23:19:46.0076701Z Error: input_handles[0]: unmatched stride value at 1, expected: 4, but got: 1 2024-08-20T23:19:46.0077164Z 2024-08-20T23:19:46.0077529Z Error: input_handles[0]: dim value is too large at 0, expected to be <= 1024, but got: 2048 2024-08-20T23:19:46.0078045Z 2024-08-20T23:19:46.0078399Z Error: input_handles[0]: dim value is too large at 0, expected to be <= 1024, but got: 2048 2024-08-20T23:19:46.0078904Z 2024-08-20T23:19:46.0079110Z PASSED [5.0572s] [ 15%] 2024-08-20T23:19:46.0080410Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_small_constant_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.5034s] [ 16%] 2024-08-20T23:19:46.0082368Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu SKIPPED [0.0050s] (requires CUDA) [ 18%] 2024-08-20T23:19:46.0084421Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_abi_compatible_cpu SKIPPED [0.0049s] (requires CUDA) [ 20%] 2024-08-20T23:19:46.0086489Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_abi_compatible_cpu SKIPPED [0.0049s] (requires CUDA) [ 21%] 2024-08-20T23:19:46.0088535Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.2553s] [ 23%] 2024-08-20T23:19:46.0090751Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_conv_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 25%] 2024-08-20T23:19:46.0093154Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_deconv_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 26%] 2024-08-20T23:19:46.0095538Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fft_c2c_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 28%] 2024-08-20T23:19:46.0098051Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_normal_functional_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [6.5738s] [ 30%] 2024-08-20T23:19:46.0100353Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_path_1_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [7.3450s] [ 31%] 2024-08-20T23:19:46.0102740Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0033s] (requires CUDA) [ 33%] 2024-08-20T23:19:46.0106003Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_view_outputs_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py In file included from /tmp/tmplfkvofj_/cp3fsfnhnwhgllflivdge65xsgizipwlgnxka3zd2ls4g4vqwyt3/czqdtmx2g62ay7ih6uigfs7slsqrrzewylajt2fut67nuqyw4c7s.cpp:5: 2024-08-20T23:19:46.0109555Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/inductor/aoti_runtime/thread_local.h: In instantiation of ‘void torch::aot_inductor::ThreadLocalCachedOutputTensor >::copy_data_from(const torch::aot_inductor::ArrayRefTensor&) [with T = float]’: 2024-08-20T23:19:46.0111946Z /tmp/tmplfkvofj_/cp3fsfnhnwhgllflivdge65xsgizipwlgnxka3zd2ls4g4vqwyt3/czqdtmx2g62ay7ih6uigfs7slsqrrzewylajt2fut67nuqyw4c7s.cpp:552:44: required from here 2024-08-20T23:19:46.0114274Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/inductor/aoti_runtime/thread_local.h:53:19: warning: comparison of integer expressions of different signedness: ‘long unsigned int’ and ‘int64_t’ {aka ‘long int’} [-Wsign-compare] 2024-08-20T23:19:46.0115820Z 53 | if (t.numel() > capacity_) { 2024-08-20T23:19:46.0116297Z PASSED [7.2715s] [ 35%] 2024-08-20T23:19:46.0118173Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_assert_async_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (requires CUDA) [ 36%] 2024-08-20T23:19:46.0121292Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0002s] (Skipped!) [ 38%] 2024-08-20T23:19:46.0124135Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_misc_1_max_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0125940Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0126670Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0135715Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.so 2024-08-20T23:19:46.0143994Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0144718Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0146651Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0149721Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0151749Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0152793Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0154844Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0157980Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0160227Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0161388Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0163450Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0166129Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0169000Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0170661Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0171603Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0172506Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0175086Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0177551Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:245:29: required from here 2024-08-20T23:19:46.0179817Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0181783Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0183000Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0184081Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0186399Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0188865Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:258:29: required from here 2024-08-20T23:19:46.0191137Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0192974Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0194185Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0195254Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0196781Z E0820 22:55:13.481000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.0198325Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.0199236Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.0200076Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0200798Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0209753Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.so 2024-08-20T23:19:46.0218039Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0218768Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0220691Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0223787Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0225831Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0226893Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0228988Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0232110Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0234291Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0235443Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0237518Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0240312Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0242845Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0244505Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0245442Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0246334Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0248685Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0251154Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:245:29: required from here 2024-08-20T23:19:46.0253410Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0255319Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0256551Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0257617Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0259933Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0262386Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:258:29: required from here 2024-08-20T23:19:46.0264655Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/i5/ci5ckxvao3wqiny3z2t5wzdfn2mqnuze6ymwvhkicx74v56ulhhc.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0266501Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0268129Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0269271Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0270115Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0270916Z E0820 22:55:15.125000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0271820Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0272621Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0273349Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0282540Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.so 2024-08-20T23:19:46.0291069Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0291958Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0293925Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0297053Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0299135Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0300187Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0302251Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0305395Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0307583Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0308733Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0310825Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0313537Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0316098Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0317764Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0318741Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0319653Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0322131Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0324624Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:201:29: required from here 2024-08-20T23:19:46.0326907Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0328892Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0330111Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0331194Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0333519Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0335985Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:214:29: required from here 2024-08-20T23:19:46.0338278Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0340129Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0341351Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0342411Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0343950Z E0820 22:55:16.722000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.0345490Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.0346399Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.0347152Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0347872Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0356900Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.so 2024-08-20T23:19:46.0365186Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0366005Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0368550Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0371747Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0373779Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0374838Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0376897Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0380057Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0382231Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0383382Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0385465Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0388160Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0390698Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0392341Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0393261Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0394167Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0396699Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0399246Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:201:29: required from here 2024-08-20T23:19:46.0401614Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0403586Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0404802Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0405885Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0408223Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0410709Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:214:29: required from here 2024-08-20T23:19:46.0413006Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/rz/crz5poojlickqqjwmaypmkoqodnsz4bkt5gve7yrqxego55liluy.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0414849Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0416055Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0417119Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0417976Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0418763Z E0820 22:55:18.379000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0419670Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0420486Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0421207Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0430255Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.so 2024-08-20T23:19:46.0438432Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0439157Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0441327Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0444429Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0446462Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0447517Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0449602Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0452722Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0454873Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0456027Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0458098Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0460775Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0463284Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0464917Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0465847Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0466745Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0469593Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0472061Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:201:29: required from here 2024-08-20T23:19:46.0474323Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0476148Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0477509Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0478585Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0480992Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0483439Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:214:29: required from here 2024-08-20T23:19:46.0485690Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0487511Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0488738Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0489820Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0491377Z E0820 22:55:20.019000 45523 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.0492924Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.0493835Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.0494607Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0495339Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0504359Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.so 2024-08-20T23:19:46.0512516Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0513345Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0515262Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0518417Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0520547Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0521612Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0523658Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0526764Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0528932Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0530088Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0532145Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0534820Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0537326Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0538975Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0539905Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0540811Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0543200Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0545669Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:201:29: required from here 2024-08-20T23:19:46.0547923Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0549805Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0551134Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0552215Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0554531Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0556993Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:214:29: required from here 2024-08-20T23:19:46.0559264Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] /tmp/tmp9iussgl8/hw/chw754egy34oum6wsh5sbtfa5ygifgaj26wi3eyky4gov77s7rkw.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0561188Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0562393Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0563461Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0564305Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0565086Z E0820 22:55:21.684000 45523 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0565818Z PASSED [17.4959s] [ 40%] 2024-08-20T23:19:46.0568033Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_normal_functional_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 41%] 2024-08-20T23:19:46.0571089Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_pytree_inputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 43%] 2024-08-20T23:19:46.0574126Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_return_constant_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 45%] 2024-08-20T23:19:46.0577159Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_reuse_kernel_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 46%] 2024-08-20T23:19:46.0580397Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_seq_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 48%] 2024-08-20T23:19:46.0583473Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_equal_to_1_arg_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0032s] (requires CUDA) [ 50%] 2024-08-20T23:19:46.0586618Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0035s] (requires CUDA) [ 51%] 2024-08-20T23:19:46.0589254Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_with_parameters_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [10.2372s] [ 53%] 2024-08-20T23:19:46.0591201Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dup_unbacked_sym_decl_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [8.1295s] [ 55%] 2024-08-20T23:19:46.0593135Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_large_mmaped_weights_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [10.0045s] [ 56%] 2024-08-20T23:19:46.0595115Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_normal_functional_abi_compatible_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 58%] 2024-08-20T23:19:46.0597102Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_runtime_checks_complex_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [5.7087s] [ 60%] 2024-08-20T23:19:46.0598814Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_simple_split_abi_compatible_cuda PASSED [7.9050s] [ 61%] 2024-08-20T23:19:46.0600586Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cuda PASSED [13.0266s] [ 63%] 2024-08-20T23:19:46.0602499Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cuda PASSED [8.7007s] [ 65%] 2024-08-20T23:19:46.0604420Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_abi_compatible_cuda PASSED [7.7802s] [ 66%] 2024-08-20T23:19:46.0606384Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_while_loop_with_outer_buffers_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [8.7587s] [ 68%] 2024-08-20T23:19:46.0608320Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_with_profiler_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [7.9078s] [ 70%] 2024-08-20T23:19:46.0610645Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_addmm_multiple_dynamic_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 22:56:57.570000 45523 torch/_inductor/kernel/mm.py:425] No choices for GEMM, using ATen backend as fallback 2024-08-20T23:19:46.0612175Z PASSED [15.1759s] [ 71%] 2024-08-20T23:19:46.0613436Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_nested_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [18.0168s] [ 73%] 2024-08-20T23:19:46.0616229Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_with_reinterpret_view_inputs_outputs_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.4482s] [ 75%] 2024-08-20T23:19:46.0618274Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_convolution_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [13.9977s] [ 76%] 2024-08-20T23:19:46.0620189Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_large_weight_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [22.3858s] [ 78%] 2024-08-20T23:19:46.0622169Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_shifted_constraint_ranges_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [14.8302s] [ 80%] 2024-08-20T23:19:46.0624440Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_sympy_fn_like_arg_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires CUDA) [ 81%] 2024-08-20T23:19:46.0626537Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_nested_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.9725s] [ 83%] 2024-08-20T23:19:46.0628497Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_simple_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.2827s] [ 85%] 2024-08-20T23:19:46.0630457Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_index_put_fallback_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.8476s] [ 86%] 2024-08-20T23:19:46.0632446Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_output_misaligned_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.6358s] [ 88%] 2024-08-20T23:19:46.0634434Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_repeat_interleave_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.0081s] [ 90%] 2024-08-20T23:19:46.0636364Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_sdpa_2_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.6180s] [ 91%] 2024-08-20T23:19:46.0638274Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_simple_dynamic_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.6556s] [ 93%] 2024-08-20T23:19:46.0640284Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_small_constant_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.4296s] [ 95%] 2024-08-20T23:19:46.0642154Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_reinterpret_view_mem_leak_non_abi_compatible_cuda PASSED [7.8751s] [ 96%] 2024-08-20T23:19:46.0644064Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_non_abi_compatible_cuda PASSED [16.1295s] [ 98%] 2024-08-20T23:19:46.0646110Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.2053s] [100%] 2024-08-20T23:19:46.0647146Z 2024-08-20T23:19:46.0648020Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-a698ea308c44f213.xml - 2024-08-20T23:19:46.0649442Z ================== 40 passed, 20 skipped in 466.20s (0:07:46) ================== 2024-08-20T23:19:46.0650158Z Got exit code -11 (SIGSEGV) 2024-08-20T23:19:46.0650513Z Retrying single test... 2024-08-20T23:19:46.0651537Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-4e28cf9369da40f3.xml 2024-08-20T23:19:46.0652595Z ============================= test session starts ============================== 2024-08-20T23:19:46.0653479Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:19:46.0654152Z cachedir: .pytest_cache 2024-08-20T23:19:46.0655011Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:19:46.0655835Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:19:46.0656222Z configfile: pytest.ini 2024-08-20T23:19:46.0657173Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:19:46.0658057Z collecting ... collected 912 items 2024-08-20T23:19:46.0658560Z stepcurrent: Cannot find last run test, not skipping 2024-08-20T23:19:46.0659032Z Running 60 items in this shard 2024-08-20T23:19:46.0659282Z 2024-08-20T23:19:46.0660634Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bmm_multiple_dynamic_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 23:06:30.412000 49583 torch/_inductor/kernel/bmm.py:164] No choices for GEMM, using ATen backend as fallback 2024-08-20T23:19:46.0662130Z PASSED [7.3259s] [ 1%] 2024-08-20T23:19:46.0663360Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [9.1146s] [ 3%] 2024-08-20T23:19:46.0665097Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_False_abi_compatible_cpu PASSED [7.6168s] [ 5%] 2024-08-20T23:19:46.0666808Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_True_abi_compatible_cpu E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0668369Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0669095Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0678006Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.so 2024-08-20T23:19:46.0686257Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0686979Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0689204Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0692321Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0694354Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0695683Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0697735Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0700866Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0703040Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0704195Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0706266Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0708996Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0711517Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0713163Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0714098Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0714984Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0717259Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0719731Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:245:29: required from here 2024-08-20T23:19:46.0722145Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0723996Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0725393Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0726468Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0728792Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0731265Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:258:29: required from here 2024-08-20T23:19:46.0733644Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0735502Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0736715Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0737780Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0739373Z E0820 23:06:55.757000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.0740905Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.0741819Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.0742589Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0743305Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0752241Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.so 2024-08-20T23:19:46.0760599Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0761323Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0763407Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0766526Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0768915Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0770234Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0772312Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0775429Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0777609Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0778772Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0780830Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0783512Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0786039Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0787691Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0788630Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0789513Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0791776Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0794250Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:245:29: required from here 2024-08-20T23:19:46.0796511Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0798338Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0799869Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0800949Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0803278Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0805842Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:258:29: required from here 2024-08-20T23:19:46.0808108Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/qp/cqpehwsdky6542eyphmip54gqp2wqp2fja5k7pjxmdhjovrh7mly.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0809993Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0811207Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0812276Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0813126Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0813922Z E0820 23:06:57.418000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0814830Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0815639Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0816363Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0825275Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.so 2024-08-20T23:19:46.0833463Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0834189Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0836244Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0839380Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0841551Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0842600Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0844761Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0847907Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0850105Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0851248Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0853332Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0856032Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0858571Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0860238Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0861157Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0862063Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0864338Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0866821Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:201:29: required from here 2024-08-20T23:19:46.0869340Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0871177Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0872399Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0873715Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0876057Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0878532Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:214:29: required from here 2024-08-20T23:19:46.0881060Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0882914Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0884124Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0885186Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.0886803Z E0820 23:06:58.996000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.0888781Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.0889705Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.0890475Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0891207Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.0900135Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.so 2024-08-20T23:19:46.0908293Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.0909021Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.0911181Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0914310Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0916342Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0917399Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0919544Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0922806Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0924991Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0926146Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0928225Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.0930952Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.0933494Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0935152Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.0936081Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0936976Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0939250Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.0941740Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:201:29: required from here 2024-08-20T23:19:46.0944020Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0945877Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0947202Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0948292Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0950647Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.0953117Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:214:29: required from here 2024-08-20T23:19:46.0955502Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/w3/cw3zjba7tllt4xey2dlxdpawujdvpzhpsqybkiqzylilxbrfns5q.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.0957360Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0958603Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.0959691Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.0960612Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.0961411Z E0820 23:07:00.635000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.0962311Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.0963122Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0963848Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.0973005Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.so 2024-08-20T23:19:46.0981187Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.0981915Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.0983838Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0987199Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.0989246Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.0990306Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.0992491Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.0995631Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.0997810Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.0998963Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1001111Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1003822Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1006355Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1007999Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1008922Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1009826Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1012095Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1014555Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:201:29: required from here 2024-08-20T23:19:46.1016832Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1018724Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1019957Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1021157Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1023479Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1025959Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:214:29: required from here 2024-08-20T23:19:46.1028328Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1030231Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1031440Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1032511Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1034054Z E0820 23:07:02.227000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.1035588Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.1036495Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.1037251Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1037981Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.1046951Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.so 2024-08-20T23:19:46.1055121Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1055847Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.1057756Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1061094Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1063154Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1064209Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1066349Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1069733Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1071909Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1073068Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1075136Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1077822Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1080403Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1082057Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1082979Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1083882Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1086138Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1088605Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:201:29: required from here 2024-08-20T23:19:46.1090878Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1092722Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1093929Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1095252Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1097584Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1100056Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:214:29: required from here 2024-08-20T23:19:46.1102470Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmpnx4t5jnu/xr/cxrulbzk7o3grntnkm7tc6eyfzdjexg5telbyku7ovmzb5un3dlm.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1104308Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1105525Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1106600Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1107452Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.1108234Z E0820 23:07:03.844000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.1108958Z PASSED [17.2056s] [ 6%] 2024-08-20T23:19:46.1110249Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_missing_cubin_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 8%] 2024-08-20T23:19:46.1112269Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multiple_output_alias_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 10%] 2024-08-20T23:19:46.1114217Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_normal_functional_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.3808s] [ 11%] 2024-08-20T23:19:46.1116075Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_reuse_kernel_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.2130s] [ 13%] 2024-08-20T23:19:46.1118150Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_shape_failed_abi_compatible_cpu <- test/inductor/test_torchinductor.py Error: input_handles[0]: unmatched dim value at 1, expected: 4, but got: 8 2024-08-20T23:19:46.1119349Z 2024-08-20T23:19:46.1119665Z Error: input_handles[0]: unmatched stride value at 1, expected: 4, but got: 1 2024-08-20T23:19:46.1120239Z 2024-08-20T23:19:46.1120607Z Error: input_handles[0]: dim value is too large at 0, expected to be <= 1024, but got: 2048 2024-08-20T23:19:46.1121131Z 2024-08-20T23:19:46.1121489Z Error: input_handles[0]: dim value is too large at 0, expected to be <= 1024, but got: 2048 2024-08-20T23:19:46.1122006Z 2024-08-20T23:19:46.1122202Z PASSED [4.9796s] [ 15%] 2024-08-20T23:19:46.1123413Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_small_constant_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [6.4464s] [ 16%] 2024-08-20T23:19:46.1125375Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu SKIPPED [0.0033s] (requires CUDA) [ 18%] 2024-08-20T23:19:46.1127558Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_True_abi_compatible_cpu SKIPPED [0.0030s] (requires CUDA) [ 20%] 2024-08-20T23:19:46.1129629Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_abi_compatible_cpu SKIPPED [0.0030s] (requires CUDA) [ 21%] 2024-08-20T23:19:46.1131683Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_unbacked_symbols_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [7.2689s] [ 23%] 2024-08-20T23:19:46.1133913Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_conv_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 25%] 2024-08-20T23:19:46.1136435Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_deconv_freezing_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 26%] 2024-08-20T23:19:46.1138827Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_fft_c2c_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 28%] 2024-08-20T23:19:46.1141162Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_normal_functional_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [6.3526s] [ 30%] 2024-08-20T23:19:46.1143468Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_output_path_1_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py PASSED [7.1679s] [ 31%] 2024-08-20T23:19:46.1145874Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_triton_kernel_grid_type_2_num_dims_1_dynamic_True_autotune_False_abi_compatible_cpu_with_stack_allocation SKIPPED [0.0033s] (requires CUDA) [ 33%] 2024-08-20T23:19:46.1148995Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocation::test_view_outputs_abi_compatible_cpu_with_stack_allocation <- test/inductor/test_torchinductor.py In file included from /tmp/tmpmjjarftl/cp3fsfnhnwhgllflivdge65xsgizipwlgnxka3zd2ls4g4vqwyt3/cvpf67ny46kvnin4lf4jljkyyojd6lvsnuanedro4yf34h45qqyo.cpp:5: 2024-08-20T23:19:46.1152532Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/inductor/aoti_runtime/thread_local.h: In instantiation of ‘void torch::aot_inductor::ThreadLocalCachedOutputTensor >::copy_data_from(const torch::aot_inductor::ArrayRefTensor&) [with T = float]’: 2024-08-20T23:19:46.1154936Z /tmp/tmpmjjarftl/cp3fsfnhnwhgllflivdge65xsgizipwlgnxka3zd2ls4g4vqwyt3/cvpf67ny46kvnin4lf4jljkyyojd6lvsnuanedro4yf34h45qqyo.cpp:552:44: required from here 2024-08-20T23:19:46.1157295Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/inductor/aoti_runtime/thread_local.h:53:19: warning: comparison of integer expressions of different signedness: ‘long unsigned int’ and ‘int64_t’ {aka ‘long int’} [-Wsign-compare] 2024-08-20T23:19:46.1158815Z 53 | if (t.numel() > capacity_) { 2024-08-20T23:19:46.1159290Z PASSED [7.1162s] [ 35%] 2024-08-20T23:19:46.1161214Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_assert_async_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (requires CUDA) [ 36%] 2024-08-20T23:19:46.1164199Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_cond_non_tensor_predicates_dynamic_False_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0002s] (Skipped!) [ 38%] 2024-08-20T23:19:46.1167124Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_misc_1_max_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.1169263Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1169986Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.1178864Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.so 2024-08-20T23:19:46.1187211Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1187929Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.1189826Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1192829Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1194834Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1195887Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1197899Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1201058Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1203209Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1204368Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1206533Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1209202Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1211661Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1213363Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1214294Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1215184Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1217410Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1219813Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:245:29: required from here 2024-08-20T23:19:46.1222026Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1223844Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1225061Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1226138Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1228432Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1230852Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:258:29: required from here 2024-08-20T23:19:46.1233061Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1234868Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1236090Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1237184Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1238832Z E0820 23:08:05.886000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.1240452Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.1241373Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.1242146Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1242866Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.1251772Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.so 2024-08-20T23:19:46.1259985Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1260706Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.1262594Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1265622Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1267934Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1269005Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1271012Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1274065Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1276214Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1277662Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1279708Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1282370Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1284991Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:208:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1286615Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 208 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1287559Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1288448Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1290677Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1293101Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:245:29: required from here 2024-08-20T23:19:46.1295322Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1297123Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1298342Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1299415Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1301706Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1304117Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:258:29: required from here 2024-08-20T23:19:46.1306307Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/r6/cr6cq7d6xg5f567w2n3h6mlofcgefu4vqtfvcxw6diga3jx7yjze.cpp:176:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1308105Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] 176 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1309380Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1310464Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1311312Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.1312200Z E0820 23:08:07.527000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.1313105Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.1313910Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1314625Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.1323571Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.so 2024-08-20T23:19:46.1331811Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1332537Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.1334418Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1337467Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1339497Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1340556Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1342575Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1345639Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1347808Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1348963Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1351106Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1353724Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1356189Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1357902Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1358823Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1359723Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1362049Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1364472Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:201:29: required from here 2024-08-20T23:19:46.1366684Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1368829Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1370054Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1371125Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1373426Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1375833Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:214:29: required from here 2024-08-20T23:19:46.1378050Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1379857Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1381075Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1382159Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1383697Z E0820 23:08:09.066000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.1385462Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.1386391Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.1387158Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1387888Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.1396725Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.so 2024-08-20T23:19:46.1405056Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1405787Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.1407682Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1410723Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1412725Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1413779Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1415801Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1418878Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1421027Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1422178Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1424311Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1426925Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1429390Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1431097Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1432025Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1432914Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1435137Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1437542Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:201:29: required from here 2024-08-20T23:19:46.1439876Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1441691Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1442902Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1443976Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1446253Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1448844Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:214:29: required from here 2024-08-20T23:19:46.1451049Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/f3/cf34jkzrn5kh7ckitguda6rm6xu43nen7rbvl62znadxn47igyo4.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1452841Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1454053Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1455209Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1456067Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.1456863Z E0820 23:08:10.688000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.1457778Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] Exception C++ compile error 2024-08-20T23:19:46.1458593Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1459315Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] Command: 2024-08-20T23:19:46.1468549Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] g++ /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.so 2024-08-20T23:19:46.1476662Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 2024-08-20T23:19:46.1477388Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] Output: 2024-08-20T23:19:46.1479291Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1482397Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1484423Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1485486Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1487768Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1490851Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1492993Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1494287Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1496342Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1498963Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1501437Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1503065Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1503989Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1504887Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1507120Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1509539Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:201:29: required from here 2024-08-20T23:19:46.1511775Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1513593Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1514805Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1515875Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1518156Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1520706Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:214:29: required from here 2024-08-20T23:19:46.1523110Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1524928Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1526139Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1527201Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] | TORCH_CHECK 2024-08-20T23:19:46.1528842Z E0820 23:08:12.278000 49583 torch/_inductor/select_algorithm.py:1300] for benchmark choice DataProcessorChoiceCallerWrapper() 2024-08-20T23:19:46.1530373Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] Runtime error during autotuning: 2024-08-20T23:19:46.1531280Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] C++ compile error 2024-08-20T23:19:46.1532029Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1532741Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] Command: 2024-08-20T23:19:46.1541584Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] g++ /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_AVX2 -shared -fPIC -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -march=native -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -fopenmp -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/include/python3.10 -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/TH -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THC -mavx2 -mfma -mf16c -D_GLIBCXX_USE_CXX11_ABI=1 -ltorch -ltorch_cpu -ltorch_python -lgomp -L/opt/conda/envs/py_3.10/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -o /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.so 2024-08-20T23:19:46.1549677Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 2024-08-20T23:19:46.1550401Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] Output: 2024-08-20T23:19:46.1552297Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm_micro_gemm_kernel(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1555351Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:19:11: warning: typedef ‘using VectorizedIn = class at::vec::CPU_CAPABILITY::Vectorized’ locally defined but not used [-Wunused-local-typedefs] 2024-08-20T23:19:46.1557379Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 19 | using VectorizedIn = at::vec::Vectorized; 2024-08-20T23:19:46.1558435Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~ 2024-08-20T23:19:46.1560731Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t)’: 2024-08-20T23:19:46.1563821Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:21: error: there are no arguments to ‘AOTI_TORCH_CHECK’ that depend on a template parameter, so a declaration of ‘AOTI_TORCH_CHECK’ must be available [-fpermissive] 2024-08-20T23:19:46.1566074Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1567231Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1569642Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:21: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) 2024-08-20T23:19:46.1572270Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In function ‘void cpp_packed_gemm(const float*, const float*, const float*, float*)’: 2024-08-20T23:19:46.1574756Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:164:5: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1576388Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 164 | AOTI_TORCH_CHECK( 2024-08-20T23:19:46.1577314Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | ^~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1578218Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1580459Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = false; int64_t = long int]’: 2024-08-20T23:19:46.1582881Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:201:29: required from here 2024-08-20T23:19:46.1585119Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1586954Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1588173Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1589286Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1591580Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp: In instantiation of ‘void cpp_packed_gemm_micro_gemm(const float*, const float*, float*, int64_t, int64_t, int64_t, int64_t, int64_t, int64_t) [with bool accum = true; int64_t = long int]’: 2024-08-20T23:19:46.1594025Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:214:29: required from here 2024-08-20T23:19:46.1596632Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] /tmp/tmp_yvk6pd2/rj/crjsvtmorsrjupmhdsezzl655gfa6vs5rnap3m6cozkbxxngfnku.cpp:132:37: error: ‘AOTI_TORCH_CHECK’ was not declared in this scope; did you mean ‘TORCH_CHECK’? 2024-08-20T23:19:46.1598460Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] 132 | AOTI_TORCH_CHECK(false, "Unsupported block_m: ", block_m); 2024-08-20T23:19:46.1599662Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | ~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 2024-08-20T23:19:46.1600997Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] | TORCH_CHECK 2024-08-20T23:19:46.1601852Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] . 2024-08-20T23:19:46.1602641Z E0820 23:08:13.884000 49583 torch/_inductor/select_algorithm.py:1503] Ignoring this choice. 2024-08-20T23:19:46.1603362Z PASSED [17.0212s] [ 40%] 2024-08-20T23:19:46.1605256Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_normal_functional_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 41%] 2024-08-20T23:19:46.1608312Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_pytree_inputs_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 43%] 2024-08-20T23:19:46.1611351Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_return_constant_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 45%] 2024-08-20T23:19:46.1614380Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_reuse_kernel_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 46%] 2024-08-20T23:19:46.1617360Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_seq_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0001s] (Skipped!) [ 48%] 2024-08-20T23:19:46.1620485Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_equal_to_1_arg_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface <- test/inductor/test_torchinductor.py SKIPPED [0.0030s] (requires CUDA) [ 50%] 2024-08-20T23:19:46.1623640Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpuWithStackAllocationAndMinimalArrayRefInterface::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_abi_compatible_cpu_with_stack_allocation_and_minimal_arrayref_interface SKIPPED [0.0029s] (requires CUDA) [ 51%] 2024-08-20T23:19:46.1626124Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_cond_with_parameters_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [10.0973s] [ 53%] 2024-08-20T23:19:46.1628047Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_dup_unbacked_sym_decl_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [8.0223s] [ 55%] 2024-08-20T23:19:46.1630031Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_large_mmaped_weights_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [9.9673s] [ 56%] 2024-08-20T23:19:46.1632202Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_normal_functional_abi_compatible_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 58%] 2024-08-20T23:19:46.1634183Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_runtime_checks_complex_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [5.3501s] [ 60%] 2024-08-20T23:19:46.1635890Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_simple_split_abi_compatible_cuda PASSED [7.6023s] [ 61%] 2024-08-20T23:19:46.1637612Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_True_abi_compatible_cuda PASSED [12.9274s] [ 63%] 2024-08-20T23:19:46.1639642Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_abi_compatible_cuda PASSED [8.5947s] [ 65%] 2024-08-20T23:19:46.1641688Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_False_abi_compatible_cuda PASSED [7.7393s] [ 66%] 2024-08-20T23:19:46.1643645Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_while_loop_with_outer_buffers_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [8.7101s] [ 68%] 2024-08-20T23:19:46.1645575Z inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCuda::test_with_profiler_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [7.8943s] [ 70%] 2024-08-20T23:19:46.1647909Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_addmm_multiple_dynamic_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py W0820 23:09:48.244000 49583 torch/_inductor/kernel/mm.py:425] No choices for GEMM, using ATen backend as fallback 2024-08-20T23:19:46.1649438Z PASSED [15.0623s] [ 71%] 2024-08-20T23:19:46.1650693Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_nested_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [17.7791s] [ 73%] 2024-08-20T23:19:46.1652743Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_cond_with_reinterpret_view_inputs_outputs_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.4744s] [ 75%] 2024-08-20T23:19:46.1654769Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_convolution_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [14.0842s] [ 76%] 2024-08-20T23:19:46.1656693Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_large_weight_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [22.4349s] [ 78%] 2024-08-20T23:19:46.1658721Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_shifted_constraint_ranges_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [14.9995s] [ 80%] 2024-08-20T23:19:46.1660860Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_triton_kernel_sympy_fn_like_arg_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py SKIPPED [0.0033s] (requires CUDA) [ 81%] 2024-08-20T23:19:46.1662951Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_nested_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.9760s] [ 83%] 2024-08-20T23:19:46.1664910Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCpu::test_while_loop_simple_non_abi_compatible_cpu <- test/inductor/test_torchinductor.py PASSED [15.3748s] [ 85%] 2024-08-20T23:19:46.1666881Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_index_put_fallback_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.8425s] [ 86%] 2024-08-20T23:19:46.1669488Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_output_misaligned_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.6904s] [ 88%] 2024-08-20T23:19:46.1671484Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_repeat_interleave_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.0941s] [ 90%] 2024-08-20T23:19:46.1673411Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_sdpa_2_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.9001s] [ 91%] 2024-08-20T23:19:46.1675466Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_simple_dynamic_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.8124s] [ 93%] 2024-08-20T23:19:46.1677423Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_small_constant_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [15.4036s] [ 95%] 2024-08-20T23:19:46.1679286Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_reinterpret_view_mem_leak_non_abi_compatible_cuda PASSED [7.9016s] [ 96%] 2024-08-20T23:19:46.1681259Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_non_abi_compatible_cuda PASSED [16.1104s] [ 98%] 2024-08-20T23:19:46.1683313Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [16.2982s] [100%] 2024-08-20T23:19:46.1684361Z 2024-08-20T23:19:46.1685245Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-4e28cf9369da40f3.xml - 2024-08-20T23:19:46.1686659Z ================== 40 passed, 20 skipped in 463.56s (0:07:43) ================== 2024-08-20T23:19:46.1687384Z Got exit code -11 (SIGSEGV) 2024-08-20T23:19:46.1687744Z Retrying single test... 2024-08-20T23:19:46.1688675Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-f7cfbcc6dc95e22a.xml 2024-08-20T23:19:46.1689737Z ============================= test session starts ============================== 2024-08-20T23:19:46.1690631Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:19:46.1691324Z cachedir: .pytest_cache 2024-08-20T23:19:46.1692177Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:19:46.1693004Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:19:46.1693407Z configfile: pytest.ini 2024-08-20T23:19:46.1694265Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:19:46.1695299Z collecting ... collected 912 items / 59 deselected / 853 selected 2024-08-20T23:19:46.1696589Z stepcurrent: skipping 59 already run items. Running only test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda 2024-08-20T23:19:46.1697715Z Running 1 items in this shard 2024-08-20T23:19:46.1697966Z 2024-08-20T23:19:46.1699019Z inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda <- test/inductor/test_torchinductor.py PASSED [17.2791s] [100%] 2024-08-20T23:19:46.1700065Z 2024-08-20T23:19:46.1700941Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-f7cfbcc6dc95e22a.xml - 2024-08-20T23:19:46.1702488Z ====================== 1 passed, 59 deselected in 17.36s ======================= 2024-08-20T23:19:46.1703147Z Got exit code 0 2024-08-20T23:19:46.1703619Z Test succeeeded in new process, continuing with the rest of the tests 2024-08-20T23:19:46.1704769Z Test results will be stored in test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-92326a8789d4e577.xml 2024-08-20T23:19:46.1705819Z ============================= test session starts ============================== 2024-08-20T23:19:46.1706709Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.5.0 -- /opt/conda/envs/py_3.10/bin/python 2024-08-20T23:19:46.1707536Z cachedir: .pytest_cache 2024-08-20T23:19:46.1708395Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2024-08-20T23:19:46.1709220Z rootdir: /var/lib/jenkins/workspace 2024-08-20T23:19:46.1709620Z configfile: pytest.ini 2024-08-20T23:19:46.1710483Z plugins: hypothesis-5.35.1, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, xdist-3.3.1, xdoctest-1.1.0, typeguard-4.3.0 2024-08-20T23:19:46.1711512Z collecting ... collected 912 items / 60 deselected / 852 selected 2024-08-20T23:19:46.1712110Z stepcurrent: skipping 60 already run items. 2024-08-20T23:19:46.1712556Z Running 0 items in this shard 2024-08-20T23:19:46.1712802Z 2024-08-20T23:19:46.1713687Z - generated xml file: /var/lib/jenkins/workspace/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-92326a8789d4e577.xml - 2024-08-20T23:19:46.1715019Z ============================ 60 deselected in 0.07s ============================ 2024-08-20T23:19:46.1716712Z The following tests failed and then succeeded when run in a new process['ul', 'test/inductor/test_aot_inductor.py::AOTInductorTestNonABICompatibleCuda::test_triton_kernel_with_none_input_non_abi_compatible_cuda'] 2024-08-20T23:19:46.1717816Z 2024-08-20T23:19:46.1718544Z FINISHED PRINTING LOG FILE of inductor/test_aot_inductor 14/16 (test/test-reports/inductor.test_aot_inductor_14.16_12ae829b1c880606_.log) 2024-08-20T23:19:46.1719287Z 2024-08-20T23:19:46.6705796Z Running test batch 'tests to run' cost 3538.75 seconds 2024-08-20T23:19:47.1719600Z 2024-08-20T23:19:47.1720271Z real 59m3.161s 2024-08-20T23:19:47.1720709Z user 73m20.951s 2024-08-20T23:19:47.1721007Z sys 11m36.625s 2024-08-20T23:19:47.1721295Z + assert_git_not_dirty 2024-08-20T23:19:47.1721991Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *rocm* ]] 2024-08-20T23:19:47.1722610Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *xla* ]] 2024-08-20T23:19:47.1726943Z ++ git status --porcelain 2024-08-20T23:19:47.1727779Z ++ grep -v '?? third_party' 2024-08-20T23:19:49.5790591Z ++ true 2024-08-20T23:19:49.5791625Z + git_status= 2024-08-20T23:19:49.5792208Z + [[ -n '' ]] 2024-08-20T23:19:49.5792571Z + test_aten 2024-08-20T23:19:49.5793018Z + echo 'Running ATen tests with pytorch lib' 2024-08-20T23:19:49.5793552Z Running ATen tests with pytorch lib 2024-08-20T23:19:49.5794002Z + [[ -n '' ]] 2024-08-20T23:19:49.5794458Z + echo 'Running test with the build folder' 2024-08-20T23:19:49.5794911Z Running test with the build folder 2024-08-20T23:19:49.5795373Z + TEST_BASE_DIR=build/bin 2024-08-20T23:19:49.5797173Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10d_cuda_test.so build/bin 2024-08-20T23:19:49.5828808Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libcaffe2_nvrtc.so build/bin 2024-08-20T23:19:49.5842870Z + ln -sf '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libmkldnn*' build/bin 2024-08-20T23:19:49.5857445Z + ln -sf '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libnccl*' build/bin 2024-08-20T23:19:49.5875879Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cuda_linalg.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_global_deps.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_python.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorchbind_test.so build/bin 2024-08-20T23:19:49.5887425Z + ls build/bin 2024-08-20T23:19:49.5928282Z BackoffTest 2024-08-20T23:19:49.5928737Z CppSignature_test 2024-08-20T23:19:49.5929417Z Dict_test 2024-08-20T23:19:49.5929777Z Dimname_test 2024-08-20T23:19:49.5930206Z FileStoreTest 2024-08-20T23:19:49.5930574Z HashStoreTest 2024-08-20T23:19:49.5930955Z IListRef_test 2024-08-20T23:19:49.5931344Z KernelFunction_test 2024-08-20T23:19:49.5931742Z List_test 2024-08-20T23:19:49.5932097Z MaybeOwned_test 2024-08-20T23:19:49.5932484Z NamedTensor_test 2024-08-20T23:19:49.5932891Z ProcessGroupGlooAsyncTest 2024-08-20T23:19:49.5933358Z ProcessGroupGlooTest 2024-08-20T23:19:49.5933784Z ProcessGroupMPITest 2024-08-20T23:19:49.5934218Z ProcessGroupNCCLErrorsTest 2024-08-20T23:19:49.5934722Z ProcessGroupNCCLTest 2024-08-20T23:19:49.5935139Z StorageUtils_test 2024-08-20T23:19:49.5935528Z TCPStoreTest 2024-08-20T23:19:49.5935916Z aot_model_compiler_test 2024-08-20T23:19:49.5936344Z apply_utils_test 2024-08-20T23:19:49.5936706Z atest 2024-08-20T23:19:49.5937072Z backend_fallback_test 2024-08-20T23:19:49.5937472Z basic 2024-08-20T23:19:49.5937802Z broadcast_test 2024-08-20T23:19:49.5938193Z c10_Bitset_test 2024-08-20T23:19:49.5938627Z c10_CompileTimeFunctionPointer_test 2024-08-20T23:19:49.5939133Z c10_ConstexprCrc_test 2024-08-20T23:19:49.5939473Z c10_DeadlockDetection_test 2024-08-20T23:19:49.5939821Z c10_DeviceGuard_test 2024-08-20T23:19:49.5940127Z c10_Device_test 2024-08-20T23:19:49.5940432Z c10_DispatchKeySet_test 2024-08-20T23:19:49.5940758Z c10_Half_test 2024-08-20T23:19:49.5941055Z c10_InlineDeviceGuard_test 2024-08-20T23:19:49.5941410Z c10_InlineStreamGuard_test 2024-08-20T23:19:49.5941752Z c10_LeftRight_test 2024-08-20T23:19:49.5942067Z c10_Metaprogramming_test 2024-08-20T23:19:49.5942403Z c10_Scalar_test 2024-08-20T23:19:49.5942697Z c10_SizesAndStrides_test 2024-08-20T23:19:49.5943032Z c10_StreamGuard_test 2024-08-20T23:19:49.5943342Z c10_SymInt_test 2024-08-20T23:19:49.5943628Z c10_Synchronized_test 2024-08-20T23:19:49.5944018Z c10_ThreadLocal_test 2024-08-20T23:19:49.5944328Z c10_TypeIndex_test 2024-08-20T23:19:49.5944644Z c10_TypeList_test 2024-08-20T23:19:49.5944943Z c10_TypeTraits_test 2024-08-20T23:19:49.5945249Z c10_accumulate_test 2024-08-20T23:19:49.5945563Z c10_bfloat16_test 2024-08-20T23:19:49.5945864Z c10_bit_cast_test 2024-08-20T23:19:49.5946160Z c10_complex_math_test 2024-08-20T23:19:49.5946475Z c10_complex_test 2024-08-20T23:19:49.5946772Z c10_cow_test 2024-08-20T23:19:49.5947088Z c10_cuda_CUDAAssertionsTest_1_var_test 2024-08-20T23:19:49.5947548Z c10_cuda_CUDAAssertionsTest_catches_stream 2024-08-20T23:19:49.5948109Z c10_cuda_CUDAAssertionsTest_catches_thread_and_block_and_device 2024-08-20T23:19:49.5948666Z c10_cuda_CUDAAssertionsTest_from_2_processes 2024-08-20T23:19:49.5949246Z c10_cuda_CUDAAssertionsTest_multiple_writes_from_blocks_and_threads 2024-08-20T23:19:49.5949909Z c10_cuda_CUDAAssertionsTest_multiple_writes_from_multiple_blocks 2024-08-20T23:19:49.5950533Z c10_cuda_CUDAAssertionsTest_multiple_writes_from_same_block 2024-08-20T23:19:49.5951027Z c10_cuda_CUDATest 2024-08-20T23:19:49.5951339Z c10_exception_test 2024-08-20T23:19:49.5951634Z c10_flags_test 2024-08-20T23:19:49.5951927Z c10_generic_math_test 2024-08-20T23:19:49.5952266Z c10_intrusive_ptr_benchmark 2024-08-20T23:19:49.5952613Z c10_intrusive_ptr_test 2024-08-20T23:19:49.5952932Z c10_irange_test 2024-08-20T23:19:49.5953221Z c10_lazy_test 2024-08-20T23:19:49.5953499Z c10_logging_test 2024-08-20T23:19:49.5953794Z c10_optional_test 2024-08-20T23:19:49.5954220Z c10_ordered_preserving_dict_test 2024-08-20T23:19:49.5954597Z c10_registry_test 2024-08-20T23:19:49.5954904Z c10_small_vector_test 2024-08-20T23:19:49.5955221Z c10_ssize_test 2024-08-20T23:19:49.5955511Z c10_string_util_test 2024-08-20T23:19:49.5955833Z c10_string_view_test 2024-08-20T23:19:49.5956155Z c10_tempfile_test 2024-08-20T23:19:49.5956453Z c10_typeid_test 2024-08-20T23:19:49.5956754Z cpu_allocator_test 2024-08-20T23:19:49.5957060Z cpu_generator_test 2024-08-20T23:19:49.5957389Z cpu_profiling_allocator_test 2024-08-20T23:19:49.5957747Z cpu_rng_test 2024-08-20T23:19:49.5958143Z cuda_allocatorTraceTracker_test 2024-08-20T23:19:49.5958528Z cuda_allocator_test 2024-08-20T23:19:49.5958850Z cuda_apply_test 2024-08-20T23:19:49.5959149Z cuda_atomic_ops_test 2024-08-20T23:19:49.5959497Z cuda_caching_host_allocator_test 2024-08-20T23:19:49.5959978Z cuda_complex_math_test 2024-08-20T23:19:49.5960304Z cuda_complex_test 2024-08-20T23:19:49.5960605Z cuda_cub_test 2024-08-20T23:19:49.5960906Z cuda_cudnn_test 2024-08-20T23:19:49.5961203Z cuda_device_test 2024-08-20T23:19:49.5961519Z cuda_distributions_test 2024-08-20T23:19:49.5961867Z cuda_dlconvertor_test 2024-08-20T23:19:49.5962187Z cuda_generator_test 2024-08-20T23:19:49.5962503Z cuda_half_test 2024-08-20T23:19:49.5962811Z cuda_integer_divider_test 2024-08-20T23:19:49.5963150Z cuda_optional_test 2024-08-20T23:19:49.5963487Z cuda_packedtensoraccessor_test 2024-08-20T23:19:49.5963887Z cuda_reportMemoryUsage_test 2024-08-20T23:19:49.5964243Z cuda_stream_test 2024-08-20T23:19:49.5964553Z cuda_vectorized_test 2024-08-20T23:19:49.5964895Z dispatch_key_set_test 2024-08-20T23:19:49.5965212Z dlconvertor_test 2024-08-20T23:19:49.5965520Z example_allreduce 2024-08-20T23:19:49.5965834Z extension_backend_test 2024-08-20T23:19:49.5966152Z half_test 2024-08-20T23:19:49.5966437Z inline_container_test 2024-08-20T23:19:49.5966756Z ivalue_test 2024-08-20T23:19:49.5967056Z kernel_function_legacy_test 2024-08-20T23:19:49.5967420Z kernel_function_test 2024-08-20T23:19:49.5967992Z kernel_lambda_legacy_test 2024-08-20T23:19:49.5968453Z kernel_lambda_test 2024-08-20T23:19:49.5968863Z kernel_stackbased_test 2024-08-20T23:19:49.5969286Z lazy_tensor_test 2024-08-20T23:19:49.5969611Z legacy_vmap_test 2024-08-20T23:19:49.5969905Z libc10.so 2024-08-20T23:19:49.5970167Z libc10_cuda.so 2024-08-20T23:19:49.5970461Z libc10d_cuda_test.so 2024-08-20T23:19:49.5970772Z libcaffe2_nvrtc.so 2024-08-20T23:19:49.5971145Z 'libmkldnn*' 2024-08-20T23:19:49.5971474Z 'libnccl*' 2024-08-20T23:19:49.5971840Z libtorch.so 2024-08-20T23:19:49.5972212Z libtorch_cpu.so 2024-08-20T23:19:49.5972604Z libtorch_cuda.so 2024-08-20T23:19:49.5972914Z libtorch_cuda_linalg.so 2024-08-20T23:19:49.5973245Z libtorch_global_deps.so 2024-08-20T23:19:49.5973583Z libtorch_python.so 2024-08-20T23:19:49.5973893Z libtorchbind_test.so 2024-08-20T23:19:49.5974232Z make_boxed_from_unboxed_functor_test 2024-08-20T23:19:49.5974623Z math_kernel_test 2024-08-20T23:19:49.5974936Z memory_format_test 2024-08-20T23:19:49.5975245Z memory_overlapping_test 2024-08-20T23:19:49.5975581Z mobile_memory_cleanup 2024-08-20T23:19:49.5975900Z native_test 2024-08-20T23:19:49.5976174Z op_allowlist_test 2024-08-20T23:19:49.5976487Z op_registration_test 2024-08-20T23:19:49.5976806Z operator_name_test 2024-08-20T23:19:49.5977104Z operators_test 2024-08-20T23:19:49.5977420Z packedtensoraccessor_test 2024-08-20T23:19:49.5977767Z parallel_benchmark 2024-08-20T23:19:49.5978060Z pow_test 2024-08-20T23:19:49.5978324Z protoc 2024-08-20T23:19:49.5978643Z protoc-3.13.0.0 2024-08-20T23:19:49.5978946Z quantized_test 2024-08-20T23:19:49.5979283Z reduce_ops_test 2024-08-20T23:19:49.5979584Z reportMemoryUsage_test 2024-08-20T23:19:49.5979916Z scalar_tensor_test 2024-08-20T23:19:49.5980216Z scalar_test 2024-08-20T23:19:49.5980502Z stride_properties_test 2024-08-20T23:19:49.5980837Z tensor_iterator_test 2024-08-20T23:19:49.5981144Z test_api 2024-08-20T23:19:49.5981406Z test_cpp_rpc 2024-08-20T23:19:49.5981853Z test_dist_autograd 2024-08-20T23:19:49.5982183Z test_edge_op_registration 2024-08-20T23:19:49.5982515Z test_jit 2024-08-20T23:19:49.5982780Z test_lazy 2024-08-20T23:19:49.5983053Z test_mobile_nnc 2024-08-20T23:19:49.5983346Z test_parallel 2024-08-20T23:19:49.5983647Z test_tensorexpr 2024-08-20T23:19:49.5983945Z thread_init_test 2024-08-20T23:19:49.5984240Z torch_shm_manager 2024-08-20T23:19:49.5984549Z tutorial_tensorexpr 2024-08-20T23:19:49.5984868Z type_ptr_test 2024-08-20T23:19:49.5985156Z type_test 2024-08-20T23:19:49.5985439Z undefined_tensor_test 2024-08-20T23:19:49.5985891Z vec_test_all_types_AVX2 2024-08-20T23:19:49.5986232Z vec_test_all_types_AVX512 2024-08-20T23:19:49.5986595Z vec_test_all_types_DEFAULT 2024-08-20T23:19:49.5987071Z verify_api_visibility 2024-08-20T23:19:49.5987502Z weakref_test 2024-08-20T23:19:49.5987866Z wrapdim_test 2024-08-20T23:19:49.5988160Z xla_tensor_test 2024-08-20T23:19:49.5988488Z + aten/tools/run_tests.sh build/bin 2024-08-20T23:19:49.5988923Z + set -e 2024-08-20T23:19:49.5989226Z ++ dirname aten/tools/run_tests.sh 2024-08-20T23:19:49.5989752Z + VALGRIND_SUP=/var/lib/jenkins/workspace/aten/tools/valgrind.sup 2024-08-20T23:19:49.5990304Z + export CPP_TESTS_DIR=build/bin 2024-08-20T23:19:49.5990693Z + CPP_TESTS_DIR=build/bin 2024-08-20T23:19:49.5991026Z + VALGRIND=ON 2024-08-20T23:19:49.5993357Z + python test/run_test.py --cpp --verbose -i cpp/basic cpp/atest cpp/scalar_test cpp/broadcast_test cpp/wrapdim_test cpp/apply_utils_test cpp/dlconvertor_test cpp/native_test cpp/scalar_tensor_test cpp/undefined_tensor_test cpp/extension_backend_test cpp/lazy_tensor_test cpp/tensor_iterator_test cpp/Dimname_test cpp/Dict_test cpp/NamedTensor_test cpp/cpu_generator_test cpp/legacy_vmap_test cpp/operators_test 2024-08-20T23:19:49.6970140Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:19:49.6971235Z import pkg_resources 2024-08-20T23:19:53.1849081Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:19:53.1957436Z Found test times from artifacts 2024-08-20T23:19:53.2392302Z Found test times from artifacts 2024-08-20T23:19:53.2408185Z Running 25% of tests based on TD 2024-08-20T23:19:53.2413715Z Running parallel tests on 3 processes 2024-08-20T23:19:53.2414341Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:19:53.2414878Z Serial tests (0): 2024-08-20T23:19:53.2415252Z Parallel tests (5): 2024-08-20T23:19:53.2415622Z cpp/Dict_test 1/1 2024-08-20T23:19:53.2416033Z cpp/Dimname_test 1/1 2024-08-20T23:19:53.2416393Z cpp/NamedTensor_test 1/1 2024-08-20T23:19:53.2416761Z cpp/apply_utils_test 1/1 2024-08-20T23:19:53.2417115Z cpp/atest 1/1 2024-08-20T23:19:53.2417442Z Name: excluded (est. time: 0.0min) 2024-08-20T23:19:53.2417830Z Serial tests (0): 2024-08-20T23:19:53.2418154Z Parallel tests (14): 2024-08-20T23:19:53.2418485Z cpp/basic 1/1 2024-08-20T23:19:53.2418796Z cpp/broadcast_test 1/1 2024-08-20T23:19:53.2419150Z cpp/cpu_generator_test 1/1 2024-08-20T23:19:53.2419527Z cpp/dlconvertor_test 1/1 2024-08-20T23:19:53.2419905Z cpp/extension_backend_test 1/1 2024-08-20T23:19:53.2420291Z cpp/lazy_tensor_test 1/1 2024-08-20T23:19:53.2420653Z cpp/legacy_vmap_test 1/1 2024-08-20T23:19:53.2421010Z cpp/native_test 1/1 2024-08-20T23:19:53.2421343Z cpp/operators_test 1/1 2024-08-20T23:19:53.2421703Z cpp/scalar_tensor_test 1/1 2024-08-20T23:19:53.2422078Z cpp/scalar_test 1/1 2024-08-20T23:19:53.2422419Z cpp/tensor_iterator_test 1/1 2024-08-20T23:19:53.2422819Z cpp/undefined_tensor_test 1/1 2024-08-20T23:19:53.2423206Z cpp/wrapdim_test 1/1 2024-08-20T23:19:53.2477609Z Running cpp/Dict_test 1/1 ... [2024-08-20 23:19:53.247299] 2024-08-20T23:19:53.2478150Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:19:53.2485204Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/Dict_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-b389bdf401504fe7.xml', '-x', '--reruns=2'] ... [2024-08-20 23:19:53.247932] 2024-08-20T23:19:55.3179282Z 2024-08-20T23:19:55.3180848Z cpp/Dict_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dict_test_1.1_aaaa37355bf8a051_.log 2024-08-20T23:19:55.3181719Z 2024-08-20T23:19:55.3182061Z Running cpp/Dimname_test 1/1 ... [2024-08-20 23:19:55.317517] 2024-08-20T23:19:55.3182666Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:19:55.3184709Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/Dimname_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-57adeefc1c3ba8ac.xml', '-x', '--reruns=2'] ... [2024-08-20 23:19:55.317933] 2024-08-20T23:19:57.2876072Z 2024-08-20T23:19:57.2877730Z cpp/Dimname_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dimname_test_1.1_a5340cd377bbd8ee_.log 2024-08-20T23:19:57.2878552Z 2024-08-20T23:19:57.2878913Z Running cpp/NamedTensor_test 1/1 ... [2024-08-20 23:19:57.287073] 2024-08-20T23:19:57.2879460Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:19:57.2881212Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/NamedTensor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-d6912d9403b7428c.xml', '-x', '--reruns=2'] ... [2024-08-20 23:19:57.287473] 2024-08-20T23:19:59.2563300Z 2024-08-20T23:19:59.2564951Z cpp/NamedTensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.NamedTensor_test_1.1_684d5d0abb1da5d7_.log 2024-08-20T23:19:59.2565901Z 2024-08-20T23:19:59.2566315Z Running cpp/apply_utils_test 1/1 ... [2024-08-20 23:19:59.256051] 2024-08-20T23:19:59.2566937Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:19:59.2569598Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/apply_utils_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-09be58b340048525.xml', '-x', '--reruns=2'] ... [2024-08-20 23:19:59.256483] 2024-08-20T23:20:01.2255141Z 2024-08-20T23:20:01.2256993Z cpp/apply_utils_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.apply_utils_test_1.1_faaba58fe49bcd2f_.log 2024-08-20T23:20:01.2257942Z 2024-08-20T23:20:01.2258249Z Running cpp/atest 1/1 ... [2024-08-20 23:20:01.225209] 2024-08-20T23:20:01.2258848Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:01.2260629Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/atest', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0b0a9ddb90faf8ba.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:01.225606] 2024-08-20T23:20:03.1946004Z 2024-08-20T23:20:03.1948041Z cpp/atest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.atest_1.1_be6f3b64ed920491_.log 2024-08-20T23:20:03.1949256Z 2024-08-20T23:20:03.1949812Z Running cpp/Dict_test 1/1 ... [2024-08-20 23:20:03.194639] 2024-08-20T23:20:03.1950346Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:03.1952694Z Running cpp/Dimname_test 1/1 ... [2024-08-20 23:20:03.194927] 2024-08-20T23:20:03.1953251Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:03.1953867Z Running cpp/NamedTensor_test 1/1 ... [2024-08-20 23:20:03.194991] 2024-08-20T23:20:03.1954430Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:03.1956985Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/Dict_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-952351d32d807991.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:03.195303] 2024-08-20T23:20:03.1960135Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/Dimname_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-11d64b0790f641b8.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:03.195536] 2024-08-20T23:20:03.1962889Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/NamedTensor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-c56e0c632a466fc5.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:03.195614] 2024-08-20T23:20:06.5188178Z 2024-08-20T23:20:06.5190199Z cpp/Dimname_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dimname_test_1.1_e1dcbccbbf55d0ff_.log 2024-08-20T23:20:06.5191349Z 2024-08-20T23:20:07.8218575Z 2024-08-20T23:20:07.8220543Z cpp/NamedTensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.NamedTensor_test_1.1_0609195c843e387d_.log 2024-08-20T23:20:07.8221674Z 2024-08-20T23:20:09.7369265Z Running cpp/apply_utils_test 1/1 ... [2024-08-20 23:20:09.736193] 2024-08-20T23:20:09.7370193Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:09.7372422Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/apply_utils_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-19e8311e51be88b7.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:09.736694] 2024-08-20T23:20:10.9438155Z Running cpp/atest 1/1 ... [2024-08-20 23:20:10.943191] 2024-08-20T23:20:10.9438983Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:10.9442758Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/atest', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-536077b20436c395.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:10.943781] 2024-08-20T23:20:13.3615588Z 2024-08-20T23:20:13.3617409Z cpp/apply_utils_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.apply_utils_test_1.1_6b8ea44231ad3dd4_.log 2024-08-20T23:20:13.3618760Z 2024-08-20T23:20:15.0929375Z 2024-08-20T23:20:15.0931557Z cpp/Dict_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dict_test_1.1_7533f43b4a5eca8b_.log 2024-08-20T23:20:15.0932903Z 2024-08-20T23:20:17.1235373Z 2024-08-20T23:20:17.1236903Z cpp/atest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.atest_1.1_ddda09b994deaa3e_.log 2024-08-20T23:20:17.1237731Z 2024-08-20T23:20:18.7340294Z Running test batch 'tests to run' cost 25.49 seconds 2024-08-20T23:20:19.2409590Z + run_if_exists tensor_interop_test 2024-08-20T23:20:19.2410075Z + local test_name=tensor_interop_test 2024-08-20T23:20:19.2410754Z + [[ -x build/bin/tensor_interop_test ]] 2024-08-20T23:20:19.2411301Z + echo 'Warning: tensor_interop_test does not exist.' 2024-08-20T23:20:19.2413097Z Warning: tensor_interop_test does not exist. 2024-08-20T23:20:19.2413706Z + run_if_exists cudnn_test 2024-08-20T23:20:19.2414172Z + local test_name=cudnn_test 2024-08-20T23:20:19.2414645Z + [[ -x build/bin/cudnn_test ]] 2024-08-20T23:20:19.2415122Z + echo 'Warning: cudnn_test does not exist.' 2024-08-20T23:20:19.2415575Z Warning: cudnn_test does not exist. 2024-08-20T23:20:19.2415989Z + run_if_exists cuda_generator_test 2024-08-20T23:20:19.2416410Z + local test_name=cuda_generator_test 2024-08-20T23:20:19.2416876Z + [[ -x build/bin/cuda_generator_test ]] 2024-08-20T23:20:19.2417508Z + python test/run_test.py --cpp --verbose -i cpp/cuda_generator_test 2024-08-20T23:20:19.3384266Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:20:19.3385376Z import pkg_resources 2024-08-20T23:20:22.8091845Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:20:22.8195644Z Found test times from artifacts 2024-08-20T23:20:22.8623636Z Found test times from artifacts 2024-08-20T23:20:22.8636726Z Running 25% of tests based on TD 2024-08-20T23:20:22.8640789Z Running parallel tests on 3 processes 2024-08-20T23:20:22.8641245Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:20:22.8641741Z Serial tests (0): 2024-08-20T23:20:22.8642111Z Parallel tests (1): 2024-08-20T23:20:22.8642466Z cpp/cuda_generator_test 1/1 2024-08-20T23:20:22.8642870Z Name: excluded (est. time: 0.0min) 2024-08-20T23:20:22.8643484Z Serial tests (0): 2024-08-20T23:20:22.8643816Z Parallel tests (0): 2024-08-20T23:20:22.8698038Z Running cpp/cuda_generator_test 1/1 ... [2024-08-20 23:20:22.869378] 2024-08-20T23:20:22.8698636Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:22.8703327Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_generator_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-63261557a3d90cef.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:22.869866] 2024-08-20T23:20:24.9889700Z 2024-08-20T23:20:24.9891667Z cpp/cuda_generator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_generator_test_1.1_6c55348e9bd6feb5_.log 2024-08-20T23:20:24.9893080Z 2024-08-20T23:20:25.3470531Z Running cpp/cuda_generator_test 1/1 ... [2024-08-20 23:20:25.346452] 2024-08-20T23:20:25.3471160Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:25.3474468Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_generator_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-7e15793039dac064.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:25.347005] 2024-08-20T23:20:30.6721613Z 2024-08-20T23:20:30.6724067Z cpp/cuda_generator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_generator_test_1.1_d9de8d24cd176ac3_.log 2024-08-20T23:20:30.6725839Z 2024-08-20T23:20:31.2150669Z Running test batch 'tests to run' cost 8.35 seconds 2024-08-20T23:20:31.7169923Z + run_if_exists apply_test 2024-08-20T23:20:31.7170482Z + local test_name=apply_test 2024-08-20T23:20:31.7171249Z + [[ -x build/bin/apply_test ]] 2024-08-20T23:20:31.7171730Z + echo 'Warning: apply_test does not exist.' 2024-08-20T23:20:31.7172197Z Warning: apply_test does not exist. 2024-08-20T23:20:31.7172604Z + run_if_exists stream_test 2024-08-20T23:20:31.7172960Z + local test_name=stream_test 2024-08-20T23:20:31.7173484Z + [[ -x build/bin/stream_test ]] 2024-08-20T23:20:31.7174126Z + echo 'Warning: stream_test does not exist.' 2024-08-20T23:20:31.7174679Z Warning: stream_test does not exist. 2024-08-20T23:20:31.7175095Z + run_if_exists cuda_half_test 2024-08-20T23:20:31.7176526Z + local test_name=cuda_half_test 2024-08-20T23:20:31.7177183Z + [[ -x build/bin/cuda_half_test ]] 2024-08-20T23:20:31.7177852Z + python test/run_test.py --cpp --verbose -i cpp/cuda_half_test 2024-08-20T23:20:31.8148196Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:20:31.8149367Z import pkg_resources 2024-08-20T23:20:35.2819399Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:20:35.2924486Z Found test times from artifacts 2024-08-20T23:20:35.3352602Z Found test times from artifacts 2024-08-20T23:20:35.3367503Z Running 25% of tests based on TD 2024-08-20T23:20:35.3371058Z Running parallel tests on 3 processes 2024-08-20T23:20:35.3371641Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:20:35.3372175Z Serial tests (0): 2024-08-20T23:20:35.3372612Z Parallel tests (1): 2024-08-20T23:20:35.3373061Z cpp/cuda_half_test 1/1 2024-08-20T23:20:35.3373803Z Name: excluded (est. time: 0.0min) 2024-08-20T23:20:35.3374199Z Serial tests (0): 2024-08-20T23:20:35.3374522Z Parallel tests (0): 2024-08-20T23:20:35.3428326Z Running cpp/cuda_half_test 1/1 ... [2024-08-20 23:20:35.342425] 2024-08-20T23:20:35.3429052Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:35.3433955Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_half_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-387afc7e488d8aec.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:35.342951] 2024-08-20T23:20:37.4124358Z 2024-08-20T23:20:37.4125810Z cpp/cuda_half_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_half_test_1.1_502ab3190b60f479_.log 2024-08-20T23:20:37.4126667Z 2024-08-20T23:20:37.8455637Z Running cpp/cuda_half_test 1/1 ... [2024-08-20 23:20:37.844930] 2024-08-20T23:20:37.8456242Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:37.8458750Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_half_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-361a12dddefba5ae.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:37.845439] 2024-08-20T23:20:40.6161894Z 2024-08-20T23:20:40.6164116Z cpp/cuda_half_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_half_test_1.1_c7f7da4995c3237a_.log 2024-08-20T23:20:40.6165120Z 2024-08-20T23:20:41.1737740Z Running test batch 'tests to run' cost 5.84 seconds 2024-08-20T23:20:41.6936987Z + run_if_exists cuda_vectorized_test 2024-08-20T23:20:41.6937622Z + local test_name=cuda_vectorized_test 2024-08-20T23:20:41.6938348Z + [[ -x build/bin/cuda_vectorized_test ]] 2024-08-20T23:20:41.6939047Z + python test/run_test.py --cpp --verbose -i cpp/cuda_vectorized_test 2024-08-20T23:20:41.7917401Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:20:41.7918493Z import pkg_resources 2024-08-20T23:20:45.2969222Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:20:45.3072480Z Found test times from artifacts 2024-08-20T23:20:45.3501163Z Found test times from artifacts 2024-08-20T23:20:45.3516383Z Running 25% of tests based on TD 2024-08-20T23:20:45.3519614Z Running parallel tests on 3 processes 2024-08-20T23:20:45.3520255Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:20:45.3520821Z Serial tests (0): 2024-08-20T23:20:45.3521227Z Parallel tests (1): 2024-08-20T23:20:45.3521582Z cpp/cuda_vectorized_test 1/1 2024-08-20T23:20:45.3521995Z Name: excluded (est. time: 0.0min) 2024-08-20T23:20:45.3522395Z Serial tests (0): 2024-08-20T23:20:45.3522716Z Parallel tests (0): 2024-08-20T23:20:45.3577367Z Running cpp/cuda_vectorized_test 1/1 ... [2024-08-20 23:20:45.357312] 2024-08-20T23:20:45.3577987Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:45.3583242Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_vectorized_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-af8ac05b6fb9dc10.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:45.357838] 2024-08-20T23:20:47.4277589Z 2024-08-20T23:20:47.4279217Z cpp/cuda_vectorized_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_vectorized_test_1.1_262b4d2d0851d725_.log 2024-08-20T23:20:47.4280232Z 2024-08-20T23:20:47.8630023Z Running cpp/cuda_vectorized_test 1/1 ... [2024-08-20 23:20:47.862496] 2024-08-20T23:20:47.8630635Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:47.8634932Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_vectorized_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-b68d52babaad5f1a.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:47.863003] 2024-08-20T23:20:50.8335483Z 2024-08-20T23:20:50.8337192Z cpp/cuda_vectorized_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_vectorized_test_1.1_b03a1b339e1c3906_.log 2024-08-20T23:20:50.8338231Z 2024-08-20T23:20:51.3817431Z Running test batch 'tests to run' cost 6.03 seconds 2024-08-20T23:20:51.8801198Z + run_if_exists cuda_distributions_test 2024-08-20T23:20:51.8802164Z + local test_name=cuda_distributions_test 2024-08-20T23:20:51.8804389Z + [[ -x build/bin/cuda_distributions_test ]] 2024-08-20T23:20:51.8805096Z + python test/run_test.py --cpp --verbose -i cpp/cuda_distributions_test 2024-08-20T23:20:51.9775827Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:20:51.9777047Z import pkg_resources 2024-08-20T23:20:55.4801549Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:20:55.4903112Z Found test times from artifacts 2024-08-20T23:20:55.5334213Z Found test times from artifacts 2024-08-20T23:20:55.5349371Z Running 25% of tests based on TD 2024-08-20T23:20:55.5353210Z Running parallel tests on 3 processes 2024-08-20T23:20:55.5353749Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:20:55.5354215Z Serial tests (0): 2024-08-20T23:20:55.5354546Z Parallel tests (1): 2024-08-20T23:20:55.5354899Z cpp/cuda_distributions_test 1/1 2024-08-20T23:20:55.5355511Z Name: excluded (est. time: 0.0min) 2024-08-20T23:20:55.5355909Z Serial tests (0): 2024-08-20T23:20:55.5356220Z Parallel tests (0): 2024-08-20T23:20:55.5411455Z Running cpp/cuda_distributions_test 1/1 ... [2024-08-20 23:20:55.540736] 2024-08-20T23:20:55.5412071Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:55.5417240Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_distributions_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0d094c0bc93ccf13.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:55.541262] 2024-08-20T23:20:57.6109470Z 2024-08-20T23:20:57.6111760Z cpp/cuda_distributions_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_distributions_test_1.1_b27957ce457b6ca5_.log 2024-08-20T23:20:57.6113094Z 2024-08-20T23:20:58.0346238Z Running cpp/cuda_distributions_test 1/1 ... [2024-08-20 23:20:58.034013] 2024-08-20T23:20:58.0346896Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:20:58.0349285Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_distributions_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-1fc376019fff7c7b.xml', '-x', '--reruns=2'] ... [2024-08-20 23:20:58.034545] 2024-08-20T23:21:01.9077757Z 2024-08-20T23:21:01.9079409Z cpp/cuda_distributions_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_distributions_test_1.1_3e05a59c6dcf4e62_.log 2024-08-20T23:21:01.9080418Z 2024-08-20T23:21:02.4554996Z Running test batch 'tests to run' cost 6.92 seconds 2024-08-20T23:21:02.9567407Z + run_if_exists cuda_optional_test 2024-08-20T23:21:02.9568210Z + local test_name=cuda_optional_test 2024-08-20T23:21:02.9568922Z + [[ -x build/bin/cuda_optional_test ]] 2024-08-20T23:21:02.9569563Z + python test/run_test.py --cpp --verbose -i cpp/cuda_optional_test 2024-08-20T23:21:03.0541483Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:21:03.0542578Z import pkg_resources 2024-08-20T23:21:06.5353985Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:21:06.5456335Z Found test times from artifacts 2024-08-20T23:21:06.5885503Z Found test times from artifacts 2024-08-20T23:21:06.5900294Z Running 25% of tests based on TD 2024-08-20T23:21:06.5904181Z Running parallel tests on 3 processes 2024-08-20T23:21:06.5904629Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:21:06.5905042Z Serial tests (0): 2024-08-20T23:21:06.5905383Z Parallel tests (1): 2024-08-20T23:21:06.5905720Z cpp/cuda_optional_test 1/1 2024-08-20T23:21:06.5906105Z Name: excluded (est. time: 0.0min) 2024-08-20T23:21:06.5906489Z Serial tests (0): 2024-08-20T23:21:06.5907021Z Parallel tests (0): 2024-08-20T23:21:06.5962330Z Running cpp/cuda_optional_test 1/1 ... [2024-08-20 23:21:06.595747] 2024-08-20T23:21:06.5962959Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:06.5967936Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_optional_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-e3248f2dc2d47e8d.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:06.596313] 2024-08-20T23:21:08.7159776Z 2024-08-20T23:21:08.7161255Z cpp/cuda_optional_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_optional_test_1.1_0f7550f7d889dd5e_.log 2024-08-20T23:21:08.7162129Z 2024-08-20T23:21:09.0467164Z Running cpp/cuda_optional_test 1/1 ... [2024-08-20 23:21:09.046171] 2024-08-20T23:21:09.0468045Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:09.0472768Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_optional_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0fdf6b3387841554.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:09.046786] 2024-08-20T23:21:11.6166520Z 2024-08-20T23:21:11.6168372Z cpp/cuda_optional_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_optional_test_1.1_c67b9a9787d814ef_.log 2024-08-20T23:21:11.6169262Z 2024-08-20T23:21:12.1749866Z Running test batch 'tests to run' cost 5.58 seconds 2024-08-20T23:21:12.6878167Z + run_if_exists cuda_tensor_interop_test 2024-08-20T23:21:12.6878677Z + local test_name=cuda_tensor_interop_test 2024-08-20T23:21:12.6879433Z + [[ -x build/bin/cuda_tensor_interop_test ]] 2024-08-20T23:21:12.6880096Z + echo 'Warning: cuda_tensor_interop_test does not exist.' 2024-08-20T23:21:12.6880660Z Warning: cuda_tensor_interop_test does not exist. 2024-08-20T23:21:12.6881129Z + run_if_exists cuda_complex_test 2024-08-20T23:21:12.6881552Z + local test_name=cuda_complex_test 2024-08-20T23:21:12.6882012Z + [[ -x build/bin/cuda_complex_test ]] 2024-08-20T23:21:12.6882629Z + python test/run_test.py --cpp --verbose -i cpp/cuda_complex_test 2024-08-20T23:21:12.7852379Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:21:12.7853470Z import pkg_resources 2024-08-20T23:21:16.2956349Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:21:16.3061380Z Found test times from artifacts 2024-08-20T23:21:16.3488083Z Found test times from artifacts 2024-08-20T23:21:16.3502755Z Running 25% of tests based on TD 2024-08-20T23:21:16.3505899Z Running parallel tests on 3 processes 2024-08-20T23:21:16.3506355Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:21:16.3506883Z Serial tests (0): 2024-08-20T23:21:16.3507324Z Parallel tests (1): 2024-08-20T23:21:16.3507743Z cpp/cuda_complex_test 1/1 2024-08-20T23:21:16.3508256Z Name: excluded (est. time: 0.0min) 2024-08-20T23:21:16.3508650Z Serial tests (0): 2024-08-20T23:21:16.3508967Z Parallel tests (0): 2024-08-20T23:21:16.3562969Z Running cpp/cuda_complex_test 1/1 ... [2024-08-20 23:21:16.355893] 2024-08-20T23:21:16.3563552Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:16.3568770Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_complex_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-e8eac0802368c4b6.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:16.356390] 2024-08-20T23:21:18.4758438Z 2024-08-20T23:21:18.4759847Z cpp/cuda_complex_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_complex_test_1.1_fcbedcb022e59884_.log 2024-08-20T23:21:18.4760718Z 2024-08-20T23:21:18.8558651Z Running cpp/cuda_complex_test 1/1 ... [2024-08-20 23:21:18.855277] 2024-08-20T23:21:18.8559769Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:18.8563167Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_complex_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-38a71fd1a46dc97b.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:18.855810] 2024-08-20T23:21:23.5799234Z 2024-08-20T23:21:23.5800790Z cpp/cuda_complex_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_complex_test_1.1_4011fd5908d1ae23_.log 2024-08-20T23:21:23.5801666Z 2024-08-20T23:21:24.1335784Z Running test batch 'tests to run' cost 7.78 seconds 2024-08-20T23:21:24.6295466Z + run_if_exists cuda_complex_math_test 2024-08-20T23:21:24.6295993Z + local test_name=cuda_complex_math_test 2024-08-20T23:21:24.6296700Z + [[ -x build/bin/cuda_complex_math_test ]] 2024-08-20T23:21:24.6297378Z + python test/run_test.py --cpp --verbose -i cpp/cuda_complex_math_test 2024-08-20T23:21:24.7266376Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:21:24.7267450Z import pkg_resources 2024-08-20T23:21:28.2028829Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:21:28.2131127Z Found test times from artifacts 2024-08-20T23:21:28.2560888Z Found test times from artifacts 2024-08-20T23:21:28.2575900Z Running 25% of tests based on TD 2024-08-20T23:21:28.2579899Z Running parallel tests on 3 processes 2024-08-20T23:21:28.2580592Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:21:28.2581117Z Serial tests (0): 2024-08-20T23:21:28.2581537Z Parallel tests (1): 2024-08-20T23:21:28.2581983Z cpp/cuda_complex_math_test 1/1 2024-08-20T23:21:28.2582474Z Name: excluded (est. time: 0.0min) 2024-08-20T23:21:28.2582953Z Serial tests (0): 2024-08-20T23:21:28.2583610Z Parallel tests (0): 2024-08-20T23:21:28.2638027Z Running cpp/cuda_complex_math_test 1/1 ... [2024-08-20 23:21:28.263289] 2024-08-20T23:21:28.2638878Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:28.2643692Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_complex_math_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-5a8b5dddb6a622e8.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:28.263841] 2024-08-20T23:21:30.3837934Z 2024-08-20T23:21:30.3839675Z cpp/cuda_complex_math_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_complex_math_test_1.1_7c4c65ed65b01632_.log 2024-08-20T23:21:30.3840711Z 2024-08-20T23:21:30.7735891Z Running cpp/cuda_complex_math_test 1/1 ... [2024-08-20 23:21:30.773006] 2024-08-20T23:21:30.7736541Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:30.7739630Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_complex_math_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-19257ebb22991d5d.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:30.773531] 2024-08-20T23:21:41.4618981Z 2024-08-20T23:21:41.4621186Z cpp/cuda_complex_math_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_complex_math_test_1.1_68da292c4f34bcc6_.log 2024-08-20T23:21:41.4622265Z 2024-08-20T23:21:42.0128742Z Running test batch 'tests to run' cost 13.75 seconds 2024-08-20T23:21:42.5446840Z + run_if_exists cuda_cub_test 2024-08-20T23:21:42.5447419Z + local test_name=cuda_cub_test 2024-08-20T23:21:42.5448162Z + [[ -x build/bin/cuda_cub_test ]] 2024-08-20T23:21:42.5448758Z + python test/run_test.py --cpp --verbose -i cpp/cuda_cub_test 2024-08-20T23:21:42.6415298Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:21:42.6416829Z import pkg_resources 2024-08-20T23:21:46.1527002Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:21:46.1629818Z Found test times from artifacts 2024-08-20T23:21:46.2058238Z Found test times from artifacts 2024-08-20T23:21:46.2072896Z Running 25% of tests based on TD 2024-08-20T23:21:46.2075940Z Running parallel tests on 3 processes 2024-08-20T23:21:46.2076539Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:21:46.2077074Z Serial tests (0): 2024-08-20T23:21:46.2077491Z Parallel tests (1): 2024-08-20T23:21:46.2077854Z cpp/cuda_cub_test 1/1 2024-08-20T23:21:46.2078229Z Name: excluded (est. time: 0.0min) 2024-08-20T23:21:46.2078613Z Serial tests (0): 2024-08-20T23:21:46.2078937Z Parallel tests (0): 2024-08-20T23:21:46.2135095Z Running cpp/cuda_cub_test 1/1 ... [2024-08-20 23:21:46.212994] 2024-08-20T23:21:46.2135711Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:46.2140306Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_cub_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-c0808012ec4fa4f6.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:46.213498] 2024-08-20T23:21:48.3332585Z 2024-08-20T23:21:48.3334111Z cpp/cuda_cub_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_cub_test_1.1_b2bfe7cc369b9fc0_.log 2024-08-20T23:21:48.3334951Z 2024-08-20T23:21:48.7103105Z Running cpp/cuda_cub_test 1/1 ... [2024-08-20 23:21:48.709733] 2024-08-20T23:21:48.7103717Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:48.7107198Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_cub_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-39adbb35aef1c6b5.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:48.710248] 2024-08-20T23:21:51.6803008Z 2024-08-20T23:21:51.6804456Z cpp/cuda_cub_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_cub_test_1.1_653ea1cf7654f4e9_.log 2024-08-20T23:21:51.6805295Z 2024-08-20T23:21:52.2336179Z Running test batch 'tests to run' cost 6.03 seconds 2024-08-20T23:21:52.7376115Z + run_if_exists cuda_atomic_ops_test 2024-08-20T23:21:52.7376673Z + local test_name=cuda_atomic_ops_test 2024-08-20T23:21:52.7377311Z + [[ -x build/bin/cuda_atomic_ops_test ]] 2024-08-20T23:21:52.7377960Z + python test/run_test.py --cpp --verbose -i cpp/cuda_atomic_ops_test 2024-08-20T23:21:52.8344689Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:21:52.8345776Z import pkg_resources 2024-08-20T23:21:56.3449791Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:21:56.3554643Z Found test times from artifacts 2024-08-20T23:21:56.3989558Z Found test times from artifacts 2024-08-20T23:21:56.4002943Z Running 25% of tests based on TD 2024-08-20T23:21:56.4006459Z Running parallel tests on 3 processes 2024-08-20T23:21:56.4006958Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:21:56.4007856Z Serial tests (0): 2024-08-20T23:21:56.4008248Z Parallel tests (1): 2024-08-20T23:21:56.4008593Z cpp/cuda_atomic_ops_test 1/1 2024-08-20T23:21:56.4008991Z Name: excluded (est. time: 0.0min) 2024-08-20T23:21:56.4009373Z Serial tests (0): 2024-08-20T23:21:56.4010491Z Parallel tests (0): 2024-08-20T23:21:56.4069338Z Running cpp/cuda_atomic_ops_test 1/1 ... [2024-08-20 23:21:56.406399] 2024-08-20T23:21:56.4070032Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:56.4074161Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_atomic_ops_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-7506717cc1e6fe70.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:56.406956] 2024-08-20T23:21:58.5271075Z 2024-08-20T23:21:58.5273023Z cpp/cuda_atomic_ops_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_atomic_ops_test_1.1_30873c2559863a6c_.log 2024-08-20T23:21:58.5274114Z 2024-08-20T23:21:58.9314483Z Running cpp/cuda_atomic_ops_test 1/1 ... [2024-08-20 23:21:58.930857] 2024-08-20T23:21:58.9315112Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:21:58.9318642Z Executing ['pytest', '/var/lib/jenkins/workspace/build/bin/cuda_atomic_ops_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-93b5d3333d4af613.xml', '-x', '--reruns=2'] ... [2024-08-20 23:21:58.931436] 2024-08-20T23:22:02.8533150Z 2024-08-20T23:22:02.8534750Z cpp/cuda_atomic_ops_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cuda_atomic_ops_test_1.1_d5a336d04929c864_.log 2024-08-20T23:22:02.8535677Z 2024-08-20T23:22:03.4015385Z Running test batch 'tests to run' cost 7.0 seconds 2024-08-20T23:22:03.9047139Z + '[' ON == ON ']' 2024-08-20T23:22:03.9048265Z + valgrind --suppressions=/var/lib/jenkins/workspace/aten/tools/valgrind.sup --error-exitcode=1 build/bin/basic '--gtest_filter=-*CUDA' 2024-08-20T23:22:03.9167431Z ==54938== Memcheck, a memory error detector 2024-08-20T23:22:03.9168784Z ==54938== Copyright (C) 2002-2022, and GNU GPL'd, by Julian Seward et al. 2024-08-20T23:22:03.9169703Z ==54938== Using Valgrind-3.20.0 and LibVEX; rerun with -h for copyright info 2024-08-20T23:22:03.9170430Z ==54938== Command: build/bin/basic --gtest_filter=-*CUDA 2024-08-20T23:22:03.9170901Z ==54938== 2024-08-20T23:22:09.4306717Z ==54938== Warning: set address range perms: large range [0x2997b000, 0x3a7c4000) (defined) 2024-08-20T23:22:09.4870550Z ==54938== Warning: set address range perms: large range [0x3a7c4000, 0x4c3c1000) (defined) 2024-08-20T23:22:09.5333900Z ==54938== Warning: set address range perms: large range [0x3b374000, 0x4c116000) (defined) 2024-08-20T23:22:09.7291829Z ==54938== Warning: set address range perms: large range [0x60772000, 0x7e609000) (defined) 2024-08-20T23:23:00.5465247Z Running main() from /var/lib/jenkins/workspace/third_party/googletest/googletest/src/gtest_main.cc 2024-08-20T23:23:00.5694467Z Note: Google Test filter = -*CUDA 2024-08-20T23:23:00.5752470Z [==========] Running 4 tests from 1 test suite. 2024-08-20T23:23:00.5768250Z [----------] Global test environment set-up. 2024-08-20T23:23:00.5797585Z [----------] 4 tests from BasicTest 2024-08-20T23:23:00.5817219Z [ RUN ] BasicTest.BasicTestCPU 2024-08-20T23:23:00.8613945Z ==54938== Warning: noted but unhandled ioctl 0x30000001 with no size/direction hints. 2024-08-20T23:23:00.8614687Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:00.8617535Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:00.8619954Z ==54938== Warning: noted but unhandled ioctl 0x4b with no size/direction hints. 2024-08-20T23:23:00.8620648Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:00.8621380Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:00.8630363Z ==54938== Warning: noted but unhandled ioctl 0x27 with no size/direction hints. 2024-08-20T23:23:00.8631079Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:00.8631799Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:01.0350093Z ==54938== Warning: noted but unhandled ioctl 0x25 with no size/direction hints. 2024-08-20T23:23:01.0350824Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:01.0351550Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:01.1361162Z ==54938== Warning: noted but unhandled ioctl 0x17 with no size/direction hints. 2024-08-20T23:23:01.1361887Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:01.1362619Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:01.1474512Z ==54938== Warning: set address range perms: large range [0x200000000, 0x300200000) (noaccess) 2024-08-20T23:23:01.1586463Z ==54938== Warning: set address range perms: large range [0x88d9d000, 0xa8d9c000) (noaccess) 2024-08-20T23:23:03.2219556Z 1142 ms 2024-08-20T23:23:03.3675580Z 50 ms 2024-08-20T23:23:03.4487183Z 73 ms 2024-08-20T23:23:05.8234323Z [ OK ] BasicTest.BasicTestCPU (5239 ms) 2024-08-20T23:23:05.8677644Z [ RUN ] BasicTest.BasicTestHalfCPU 2024-08-20T23:23:06.3774874Z 407 ms 2024-08-20T23:23:06.4212601Z 38 ms 2024-08-20T23:23:06.5036969Z 68 ms 2024-08-20T23:23:06.5797483Z [ OK ] BasicTest.BasicTestHalfCPU (652 ms) 2024-08-20T23:23:06.5798623Z [ RUN ] BasicTest.FactoryMethodsTest 2024-08-20T23:23:06.7059939Z ==54938== Warning: noted but unhandled ioctl 0x19 with no size/direction hints. 2024-08-20T23:23:06.7061358Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:06.7062114Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:06.8282715Z ==54938== Warning: noted but unhandled ioctl 0x49 with no size/direction hints. 2024-08-20T23:23:06.8283562Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:06.8284314Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:07.0189810Z ==54938== Warning: noted but unhandled ioctl 0x21 with no size/direction hints. 2024-08-20T23:23:07.0190640Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:07.0191487Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:07.1842333Z ==54938== Warning: noted but unhandled ioctl 0x1b with no size/direction hints. 2024-08-20T23:23:07.1843091Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:07.1843821Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:07.2994825Z ==54938== Warning: noted but unhandled ioctl 0x44 with no size/direction hints. 2024-08-20T23:23:07.2996151Z ==54938== This could cause spurious value errors to appear. 2024-08-20T23:23:07.2997494Z ==54938== See README_MISSING_SYSCALL_OR_IOCTL for guidance on writing a proper wrapper. 2024-08-20T23:23:08.3031828Z [ OK ] BasicTest.FactoryMethodsTest (1723 ms) 2024-08-20T23:23:08.3032416Z [ RUN ] BasicTest.BasicStdTestCPU 2024-08-20T23:23:08.4243622Z Simple example: called once 2024-08-20T23:23:08.4357233Z Didn't throw, call_once will not attempt again 2024-08-20T23:23:08.4694064Z [ OK ] BasicTest.BasicStdTestCPU (166 ms) 2024-08-20T23:23:08.4715999Z [----------] 4 tests from BasicTest (7888 ms total) 2024-08-20T23:23:08.4716344Z 2024-08-20T23:23:08.4724634Z [----------] Global test environment tear-down 2024-08-20T23:23:08.4751561Z [==========] 4 tests from 1 test suite ran. (7908 ms total) 2024-08-20T23:23:08.4764197Z [ PASSED ] 4 tests. 2024-08-20T23:23:11.3235289Z ==54938== 2024-08-20T23:23:11.3246577Z ==54938== HEAP SUMMARY: 2024-08-20T23:23:11.3247492Z ==54938== in use at exit: 17,140,002 bytes in 14,159 blocks 2024-08-20T23:23:11.3248214Z ==54938== total heap usage: 986,231 allocs, 972,072 frees, 283,887,736 bytes allocated 2024-08-20T23:23:11.3248812Z ==54938== 2024-08-20T23:23:12.6305745Z ==54938== LEAK SUMMARY: 2024-08-20T23:23:12.6306230Z ==54938== definitely lost: 288 bytes in 3 blocks 2024-08-20T23:23:12.6306776Z ==54938== indirectly lost: 192 bytes in 2 blocks 2024-08-20T23:23:12.6307313Z ==54938== possibly lost: 27,776 bytes in 191 blocks 2024-08-20T23:23:12.6307878Z ==54938== still reachable: 17,111,746 bytes in 13,963 blocks 2024-08-20T23:23:12.6308850Z ==54938== suppressed: 0 bytes in 0 blocks 2024-08-20T23:23:12.6309713Z ==54938== Rerun with --leak-check=full to see details of leaked memory 2024-08-20T23:23:12.6310255Z ==54938== 2024-08-20T23:23:12.6310744Z ==54938== For lists of detected and suppressed errors, rerun with: -s 2024-08-20T23:23:12.6311465Z ==54938== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 4 from 4) 2024-08-20T23:23:12.8755557Z + [[ -x build/bin/tensor_interop_test ]] 2024-08-20T23:23:12.8757174Z + [[ -n '' ]] 2024-08-20T23:23:12.8757491Z + assert_git_not_dirty 2024-08-20T23:23:12.8757995Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *rocm* ]] 2024-08-20T23:23:12.8758622Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *xla* ]] 2024-08-20T23:23:12.8766279Z ++ git status --porcelain 2024-08-20T23:23:12.8766992Z ++ grep -v '?? third_party' 2024-08-20T23:23:13.1154538Z ++ true 2024-08-20T23:23:13.1156935Z + git_status= 2024-08-20T23:23:13.1157667Z + [[ -n '' ]] 2024-08-20T23:23:13.1158075Z + test_libtorch 1 2024-08-20T23:23:13.1158411Z + local SHARD=1 2024-08-20T23:23:13.1158725Z + [[ default != \s\l\o\w ]] 2024-08-20T23:23:13.1159125Z + echo 'Testing libtorch' 2024-08-20T23:23:13.1159461Z Testing libtorch 2024-08-20T23:23:13.1160680Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libbackend_with_compiler.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1174917Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libjitbackend_test.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1189143Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libcaffe2_nvrtc.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1205065Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10d_cuda_test.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1219556Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libshm.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1237006Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cuda_linalg.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_global_deps.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_python.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorchbind_test.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1251380Z + ln -sf '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libnvfuser*' /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1265343Z + export CPP_TESTS_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1266567Z + CPP_TESTS_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-08-20T23:23:13.1267256Z + [[ -z 1 ]] 2024-08-20T23:23:13.1267536Z + [[ 1 == \1 ]] 2024-08-20T23:23:13.1269460Z + test_libtorch_api 2024-08-20T23:23:13.1269903Z + MNIST_DIR=/var/lib/jenkins/workspace/test/cpp/api/mnist 2024-08-20T23:23:13.1270757Z + python tools/download_mnist.py --quiet -d /var/lib/jenkins/workspace/test/cpp/api/mnist 2024-08-20T23:23:13.1735473Z Downloading http://yann.lecun.com/exdb/mnist/train-images-idx3-ubyte.gz ... 2024-08-20T23:23:13.2608172Z Failed to download (trying next): 2024-08-20T23:23:13.2608624Z HTTP Error 403: Forbidden 2024-08-20T23:23:13.2613711Z Downloading https://ossci-datasets.s3.amazonaws.com/mnist/train-images-idx3-ubyte.gz ... 2024-08-20T23:23:13.6270533Z Downloading http://yann.lecun.com/exdb/mnist/train-labels-idx1-ubyte.gz ... 2024-08-20T23:23:13.7101709Z Failed to download (trying next): 2024-08-20T23:23:13.7102529Z HTTP Error 403: Forbidden 2024-08-20T23:23:13.7105873Z Downloading https://ossci-datasets.s3.amazonaws.com/mnist/train-labels-idx1-ubyte.gz ... 2024-08-20T23:23:13.7421072Z Downloading http://yann.lecun.com/exdb/mnist/t10k-images-idx3-ubyte.gz ... 2024-08-20T23:23:13.8197843Z Failed to download (trying next): 2024-08-20T23:23:13.8198270Z HTTP Error 403: Forbidden 2024-08-20T23:23:13.8202714Z Downloading https://ossci-datasets.s3.amazonaws.com/mnist/t10k-images-idx3-ubyte.gz ... 2024-08-20T23:23:13.9102742Z Downloading http://yann.lecun.com/exdb/mnist/t10k-labels-idx1-ubyte.gz ... 2024-08-20T23:23:13.9898182Z Failed to download (trying next): 2024-08-20T23:23:13.9898599Z HTTP Error 403: Forbidden 2024-08-20T23:23:13.9902759Z Downloading https://ossci-datasets.s3.amazonaws.com/mnist/t10k-labels-idx1-ubyte.gz ... 2024-08-20T23:23:14.0374721Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *asan* ]] 2024-08-20T23:23:14.0375522Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *slow-gradcheck* ]] 2024-08-20T23:23:14.0376097Z + OMP_NUM_THREADS=2 2024-08-20T23:23:14.0376587Z + TORCH_CPP_TEST_MNIST_PATH=/var/lib/jenkins/workspace/test/cpp/api/mnist 2024-08-20T23:23:14.0377443Z + python test/run_test.py --cpp --verbose -i cpp/test_api -k 'not IMethodTest' 2024-08-20T23:23:14.1351402Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:23:14.1353559Z import pkg_resources 2024-08-20T23:23:17.6095833Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:23:17.6196844Z Found test times from artifacts 2024-08-20T23:23:17.6627057Z Found test times from artifacts 2024-08-20T23:23:17.6641350Z Running 25% of tests based on TD 2024-08-20T23:23:17.6644523Z Running parallel tests on 3 processes 2024-08-20T23:23:17.6645148Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:23:17.6645667Z Serial tests (0): 2024-08-20T23:23:17.6646086Z Parallel tests (1): 2024-08-20T23:23:17.6646422Z cpp/test_api 1/1 2024-08-20T23:23:17.6646756Z Name: excluded (est. time: 0.0min) 2024-08-20T23:23:17.6647139Z Serial tests (0): 2024-08-20T23:23:17.6649608Z Parallel tests (0): 2024-08-20T23:23:17.6706667Z Running cpp/test_api 1/1 ... [2024-08-20 23:23:17.670245] 2024-08-20T23:23:17.6707371Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:23:17.6712576Z Executing ['pytest', '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/test_api', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-6bf11ad7ad4fe1a9.xml', '-k', 'not IMethodTest', '-x', '--reruns=2'] ... [2024-08-20 23:23:17.670772] 2024-08-20T23:23:20.3416078Z 2024-08-20T23:23:20.3417390Z cpp/test_api 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.test_api_1.1_3b6eda8b51a0f983_.log 2024-08-20T23:23:20.3418182Z 2024-08-20T23:23:20.3421653Z Running cpp/test_api 1/1 ... [2024-08-20 23:23:20.341853] 2024-08-20T23:23:20.3422172Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:23:20.3428791Z Executing ['pytest', '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/test_api', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-a1fd38355b5e06bc.xml', '-k', 'not IMethodTest', '-x', '--reruns=2'] ... [2024-08-20 23:23:20.342434] 2024-08-20T23:27:23.7255991Z 2024-08-20T23:27:23.7257249Z cpp/test_api 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.test_api_1.1_4bc39d765caf7435_.log 2024-08-20T23:27:23.7270902Z 2024-08-20T23:27:24.2872402Z Running test batch 'tests to run' cost 246.62 seconds 2024-08-20T23:27:24.7874641Z + python test/run_test.py --cpp --verbose -i cpp/test_tensorexpr 2024-08-20T23:27:24.8848489Z /var/lib/jenkins/workspace/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-08-20T23:27:24.8849591Z import pkg_resources 2024-08-20T23:27:28.3696825Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/workspace/test/.pytorch-disabled-tests.json 2024-08-20T23:27:28.3800179Z Found test times from artifacts 2024-08-20T23:27:28.4227779Z Found test times from artifacts 2024-08-20T23:27:28.4243009Z Running 25% of tests based on TD 2024-08-20T23:27:28.4245895Z Running parallel tests on 3 processes 2024-08-20T23:27:28.4246363Z Name: tests to run (est. time: 0.0min) 2024-08-20T23:27:28.4246779Z Serial tests (0): 2024-08-20T23:27:28.4250500Z Parallel tests (1): 2024-08-20T23:27:28.4250930Z cpp/test_tensorexpr 1/1 2024-08-20T23:27:28.4251327Z Name: excluded (est. time: 0.0min) 2024-08-20T23:27:28.4251753Z Serial tests (0): 2024-08-20T23:27:28.4252084Z Parallel tests (0): 2024-08-20T23:27:28.4304706Z Running cpp/test_tensorexpr 1/1 ... [2024-08-20 23:27:28.430030] 2024-08-20T23:27:28.4305293Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:27:28.4311183Z Executing ['pytest', '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/test_tensorexpr', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-64ad9f83db2cb0d7.xml', '-x', '--reruns=2'] ... [2024-08-20 23:27:28.430547] 2024-08-20T23:27:30.7514673Z 2024-08-20T23:27:30.7516692Z cpp/test_tensorexpr 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.test_tensorexpr_1.1_57382aa597ab4482_.log 2024-08-20T23:27:30.7517668Z 2024-08-20T23:27:30.9306788Z Running cpp/test_tensorexpr 1/1 ... [2024-08-20 23:27:30.929975] 2024-08-20T23:27:30.9307401Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-08-20T23:27:30.9311149Z Executing ['pytest', '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin/test_tensorexpr', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '3', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-9083f680d0f26ed7.xml', '-x', '--reruns=2'] ... [2024-08-20 23:27:30.930495] 2024-08-20T23:30:34.7388683Z 2024-08-20T23:30:34.7390400Z cpp/test_tensorexpr 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.test_tensorexpr_1.1_4c13cfeca17e4fbc_.log 2024-08-20T23:30:34.7400368Z 2024-08-20T23:30:35.3022411Z Running test batch 'tests to run' cost 186.88 seconds 2024-08-20T23:30:35.8477311Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *android* ]] 2024-08-20T23:30:35.8478025Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *cuda* ]] 2024-08-20T23:30:35.8478528Z + [[ -z 1 ]] 2024-08-20T23:30:35.8478816Z + [[ 1 == \2 ]] 2024-08-20T23:30:35.8479123Z + assert_git_not_dirty 2024-08-20T23:30:35.8484362Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *rocm* ]] 2024-08-20T23:30:35.8485218Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 != *xla* ]] 2024-08-20T23:30:35.8485760Z ++ git status --porcelain 2024-08-20T23:30:35.8486173Z ++ grep -v '?? third_party' 2024-08-20T23:30:36.0872901Z ++ true 2024-08-20T23:30:36.0873303Z + git_status= 2024-08-20T23:30:36.0873908Z + [[ -n '' ]] 2024-08-20T23:30:36.0874536Z + [[ linux-focal-cuda12.4-py3.10-gcc9-sm86 == *xpu* ]] 2024-08-20T23:30:36.0875612Z + cleanup_workspace 2024-08-20T23:30:36.0876532Z + echo 'sudo may print the following warning message that can be ignored. The chown command will still run.' 2024-08-20T23:30:36.0877574Z sudo may print the following warning message that can be ignored. The chown command will still run. 2024-08-20T23:30:36.0878485Z + echo ' sudo: setrlimit(RLIMIT_STACK): Operation not permitted' 2024-08-20T23:30:36.0879123Z sudo: setrlimit(RLIMIT_STACK): Operation not permitted 2024-08-20T23:30:36.0880042Z + echo 'For more details refer to https://github.com/sudo-project/sudo/issues/42' 2024-08-20T23:30:36.0881151Z For more details refer to https://github.com/sudo-project/sudo/issues/42 2024-08-20T23:30:36.0881835Z + sudo chown -R 1000 /var/lib/jenkins/workspace 2024-08-20T23:30:36.8325174Z ##[group]Run cat test/**/*_toprint.log || true 2024-08-20T23:30:36.8325682Z cat test/**/*_toprint.log || true 2024-08-20T23:30:36.8339461Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:36.8339958Z env: 2024-08-20T23:30:36.8340234Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:36.8340683Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:36.8341409Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:36.8342044Z ##[endgroup] 2024-08-20T23:30:36.8437225Z cat: 'test/**/*_toprint.log': No such file or directory 2024-08-20T23:30:36.8470843Z ##[group]Run kill "$MONITOR_SCRIPT_PID" 2024-08-20T23:30:36.8471313Z kill "$MONITOR_SCRIPT_PID" 2024-08-20T23:30:36.8479573Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:36.8480083Z env: 2024-08-20T23:30:36.8480374Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:36.8480829Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:36.8481560Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:36.8482218Z MONITOR_SCRIPT_PID: 52420 2024-08-20T23:30:36.8482579Z ##[endgroup] 2024-08-20T23:30:36.8677254Z Prepare all required actions 2024-08-20T23:30:36.8677700Z Getting action download info 2024-08-20T23:30:37.1058345Z Download action repository 'actions/upload-artifact@v3' (SHA:a8a3f3ad30e3422c9c7b888a15615d19a852ae32) 2024-08-20T23:30:37.2805112Z ##[group]Run ./.github/actions/upload-test-artifacts 2024-08-20T23:30:37.2805581Z with: 2024-08-20T23:30:37.2806099Z file-suffix: test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828 2024-08-20T23:30:37.2806744Z s3-bucket: gha-artifacts 2024-08-20T23:30:37.2807077Z env: 2024-08-20T23:30:37.2807353Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.2807822Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.2808543Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.2809180Z ##[endgroup] 2024-08-20T23:30:37.2835308Z ##[group]Run # Remove any previous test jsons if they exist 2024-08-20T23:30:37.2835939Z # Remove any previous test jsons if they exist 2024-08-20T23:30:37.2836432Z rm -f test-jsons-*.zip 2024-08-20T23:30:37.2836946Z zip -r "test-jsons-${FILE_SUFFIX}.zip" test -i '*.json' 2024-08-20T23:30:37.2846078Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:37.2846559Z env: 2024-08-20T23:30:37.2846844Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.2847290Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.2848013Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.2848887Z FILE_SUFFIX: test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828 2024-08-20T23:30:37.2849521Z ##[endgroup] 2024-08-20T23:30:37.3240483Z adding: test/allowlist_for_publicAPI.json (deflated 79%) 2024-08-20T23:30:37.3270554Z adding: test/benchmark_utils/callgrind_artifacts.json (deflated 92%) 2024-08-20T23:30:37.3271233Z adding: test/minioptest_failures_dict.json (deflated 70%) 2024-08-20T23:30:37.3277521Z adding: test/profiler/profiler_utils_mock_events.json (deflated 87%) 2024-08-20T23:30:37.3283281Z adding: test/slow_tests.json (deflated 82%) 2024-08-20T23:30:37.3286876Z adding: test/test-reports/td_exclusions-9275f8d2f866fb69e4c7.json (deflated 82%) 2024-08-20T23:30:37.3288033Z adding: test/test-reports/td_exclusions-460607718ae0b7ca755b.json (deflated 73%) 2024-08-20T23:30:37.3289146Z adding: test/test-reports/td_exclusions-6a32a7143d38ef2bbb56.json (deflated 14%) 2024-08-20T23:30:37.3290117Z adding: test/test-reports/td_exclusions-24c1051a2b589d93cbbe.json (deflated 15%) 2024-08-20T23:30:37.3291301Z adding: test/test-reports/td_exclusions-3ba93e6000d3217e79c3.json (deflated 14%) 2024-08-20T23:30:37.3292224Z adding: test/test-reports/td_exclusions-50ed1de59bf8f07e052b.json (deflated 13%) 2024-08-20T23:30:37.3293146Z adding: test/test-reports/td_exclusions-11d4f8a6f59e94d50e5a.json (deflated 14%) 2024-08-20T23:30:37.3294082Z adding: test/test-reports/td_exclusions-2726d0707e2805fb3626.json (deflated 14%) 2024-08-20T23:30:37.3294998Z adding: test/test-reports/td_exclusions-30cacb0a07ed469a1996.json (deflated 13%) 2024-08-20T23:30:37.3295926Z adding: test/test-reports/td_exclusions-c1f2d371a06abc049642.json (deflated 15%) 2024-08-20T23:30:37.3296842Z adding: test/test-reports/td_exclusions-ba4102794d0e10ba7866.json (deflated 14%) 2024-08-20T23:30:37.3297747Z adding: test/test-reports/td_exclusions-68caf17e9579f4863b39.json (deflated 18%) 2024-08-20T23:30:37.3298670Z adding: test/test-reports/td_exclusions-75b7ea67868268bb94a9.json (deflated 16%) 2024-08-20T23:30:37.3301737Z adding: test/.pytorch-disabled-tests.json (deflated 88%) 2024-08-20T23:30:37.3334568Z ##[group]Run # Remove any previous test reports if they exist 2024-08-20T23:30:37.3335206Z # Remove any previous test reports if they exist 2024-08-20T23:30:37.3335739Z rm -f test-reports-*.zip 2024-08-20T23:30:37.3336335Z zip -r "test-reports-${FILE_SUFFIX}.zip" test -i '*.xml' -i '*.csv' 2024-08-20T23:30:37.3345637Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:37.3346142Z env: 2024-08-20T23:30:37.3346435Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.3346887Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.3347621Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.3348493Z FILE_SUFFIX: test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828 2024-08-20T23:30:37.3349118Z ##[endgroup] 2024-08-20T23:30:37.3628347Z adding: test/test-reports/python-pytest/inductor.test_max_autotune/inductor.test_max_autotune-ef4182cc4a18381c.xml (deflated 91%) 2024-08-20T23:30:37.3629923Z adding: test/test-reports/python-pytest/inductor.test_distributed_patterns/inductor.test_distributed_patterns-00751d113ff77be5.xml (deflated 86%) 2024-08-20T23:30:37.3723609Z adding: test/test-reports/python-pytest/test_utils/test_utils-8214423e9c260cc6.xml (deflated 98%) 2024-08-20T23:30:37.3787332Z adding: test/test-reports/python-pytest/test_nn/test_nn-74d8c8af77c69953.xml (deflated 97%) 2024-08-20T23:30:37.3788671Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-a48120e5e02c4a61.xml (deflated 28%) 2024-08-20T23:30:37.3793309Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_opinfo/inductor.test_torchinductor_opinfo-dd88ac8dcf1320d8.xml (deflated 93%) 2024-08-20T23:30:37.3795281Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-beb694170d6929cb.xml (deflated 29%) 2024-08-20T23:30:37.3797325Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-152fb35d755fb589.xml (deflated 28%) 2024-08-20T23:30:37.3799454Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-e54f4e7b49548909.xml (deflated 28%) 2024-08-20T23:30:37.3802526Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-1ffaee2ef4079b9e.xml (deflated 92%) 2024-08-20T23:30:37.3809745Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-d93d40ff7223d4b9.xml (deflated 92%) 2024-08-20T23:30:37.3819031Z adding: test/test-reports/python-pytest/inductor.test_torchinductor_dynamic_shapes/inductor.test_torchinductor_dynamic_shapes-287b4fb0ac3e9831.xml (deflated 92%) 2024-08-20T23:30:37.3820827Z adding: test/test-reports/python-pytest/inductor.test_mmdecomp/inductor.test_mmdecomp-93465b9081356c9f.xml (deflated 28%) 2024-08-20T23:30:37.3822171Z adding: test/test-reports/python-pytest/inductor.test_mmdecomp/inductor.test_mmdecomp-2621a8c81bc4e853.xml (deflated 84%) 2024-08-20T23:30:37.3823485Z adding: test/test-reports/python-pytest/dynamo.test_interop/dynamo.test_interop-74f86b57e462b9bc.xml (deflated 28%) 2024-08-20T23:30:37.3824799Z adding: test/test-reports/python-pytest/dynamo.test_interop/dynamo.test_interop-87fbdb8ff497e221.xml (deflated 71%) 2024-08-20T23:30:37.3826087Z adding: test/test-reports/python-pytest/dynamo.test_logging/dynamo.test_logging-be84a9af3a7b796e.xml (deflated 28%) 2024-08-20T23:30:37.3866144Z adding: test/test-reports/python-pytest/dynamo.test_logging/dynamo.test_logging-88a4afda697f4fd6.xml (deflated 94%) 2024-08-20T23:30:37.3867402Z adding: test/test-reports/python-pytest/dynamo.test_exc/dynamo.test_exc-1a4935f28f8b3fe8.xml (deflated 28%) 2024-08-20T23:30:37.3880397Z adding: test/test-reports/python-pytest/dynamo.test_exc/dynamo.test_exc-a00b78fb987151a3.xml (deflated 95%) 2024-08-20T23:30:37.3881656Z adding: test/test-reports/python-pytest/dynamo.test_global/dynamo.test_global-3da6a40a45734975.xml (deflated 28%) 2024-08-20T23:30:37.3882918Z adding: test/test-reports/python-pytest/dynamo.test_global/dynamo.test_global-84ffa62ed0e43f23.xml (deflated 86%) 2024-08-20T23:30:37.3884413Z adding: test/test-reports/python-pytest/dynamo.test_unspec/dynamo.test_unspec-ba9712f414b4a479.xml (deflated 28%) 2024-08-20T23:30:37.3885693Z adding: test/test-reports/python-pytest/dynamo.test_unspec/dynamo.test_unspec-5d947974493cd6d4.xml (deflated 84%) 2024-08-20T23:30:37.3887086Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees/inductor.test_cudagraph_trees-2745dc355acbfa06.xml (deflated 28%) 2024-08-20T23:30:37.3890278Z adding: test/test-reports/python-pytest/inductor.test_cudagraph_trees/inductor.test_cudagraph_trees-5272daa47bc5cd9b.xml (deflated 91%) 2024-08-20T23:30:37.3891722Z adding: test/test-reports/python-pytest/dynamo.test_ctx_manager/dynamo.test_ctx_manager-711aa706a9237eed.xml (deflated 28%) 2024-08-20T23:30:37.3893933Z adding: test/test-reports/python-pytest/dynamo.test_ctx_manager/dynamo.test_ctx_manager-e02f6df477db3fc1.xml (deflated 88%) 2024-08-20T23:30:37.3895372Z adding: test/test-reports/python-pytest/dynamo.test_subgraphs/dynamo.test_subgraphs-d9402d7f0fdacb77.xml (deflated 29%) 2024-08-20T23:30:37.3897158Z adding: test/test-reports/python-pytest/dynamo.test_subgraphs/dynamo.test_subgraphs-48fb809f514f6522.xml (deflated 96%) 2024-08-20T23:30:37.3898604Z adding: test/test-reports/python-pytest/inductor.test_pattern_matcher/inductor.test_pattern_matcher-887795df27e58e60.xml (deflated 29%) 2024-08-20T23:30:37.3900229Z adding: test/test-reports/python-pytest/inductor.test_pattern_matcher/inductor.test_pattern_matcher-dd009055282b9c9b.xml (deflated 90%) 2024-08-20T23:30:37.3901794Z adding: test/test-reports/python-pytest/dynamo.test_autograd_function/dynamo.test_autograd_function-96858319287762df.xml (deflated 28%) 2024-08-20T23:30:37.3903296Z adding: test/test-reports/python-pytest/dynamo.test_autograd_function/dynamo.test_autograd_function-1829fda48228b124.xml (deflated 86%) 2024-08-20T23:30:37.3904976Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-f9405c3aa06569ae.xml (deflated 29%) 2024-08-20T23:30:37.3906765Z adding: test/test-reports/python-pytest/dynamo.test_activation_checkpointing/dynamo.test_activation_checkpointing-14eb1d1fc25f9f0d.xml (deflated 89%) 2024-08-20T23:30:37.3908350Z adding: test/test-reports/python-pytest/inductor.test_inductor_freezing/inductor.test_inductor_freezing-a34917fdd59de0d4.xml (deflated 29%) 2024-08-20T23:30:37.3909851Z adding: test/test-reports/python-pytest/inductor.test_inductor_freezing/inductor.test_inductor_freezing-89506e4d697c8ba7.xml (deflated 91%) 2024-08-20T23:30:37.3911455Z adding: test/test-reports/python-pytest/inductor.test_mkldnn_pattern_matcher/inductor.test_mkldnn_pattern_matcher-06013de3bbf5f57c.xml (deflated 28%) 2024-08-20T23:30:37.3913465Z adding: test/test-reports/python-pytest/inductor.test_mkldnn_pattern_matcher/inductor.test_mkldnn_pattern_matcher-b6db48dcae478ac6.xml (deflated 93%) 2024-08-20T23:30:37.3915005Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-bfccf9f0bd5fc5b4.xml (deflated 28%) 2024-08-20T23:30:37.3916496Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-cf87db9c7cefed6b.xml (deflated 28%) 2024-08-20T23:30:37.3917977Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-86d1ea3d857afe34.xml (deflated 28%) 2024-08-20T23:30:37.3919507Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-23f02f0912cc5941.xml (deflated 88%) 2024-08-20T23:30:37.3920998Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-6d0406a967180785.xml (deflated 89%) 2024-08-20T23:30:37.3922444Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-a698ea308c44f213.xml (deflated 89%) 2024-08-20T23:30:37.3924114Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-e2301ff56e81127e.xml (deflated 88%) 2024-08-20T23:30:37.3925776Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-2235af948063f604.xml (deflated 36%) 2024-08-20T23:30:37.3927434Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-5c9d0442aa718d17.xml (deflated 28%) 2024-08-20T23:30:37.3929284Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-4e28cf9369da40f3.xml (deflated 89%) 2024-08-20T23:30:37.3931112Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-f7cfbcc6dc95e22a.xml (deflated 37%) 2024-08-20T23:30:37.3932883Z adding: test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-92326a8789d4e577.xml (deflated 28%) 2024-08-20T23:30:37.3934539Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-b389bdf401504fe7.xml (deflated 29%) 2024-08-20T23:30:37.3936075Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-57adeefc1c3ba8ac.xml (deflated 28%) 2024-08-20T23:30:37.3937585Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-d6912d9403b7428c.xml (deflated 29%) 2024-08-20T23:30:37.3939088Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-09be58b340048525.xml (deflated 29%) 2024-08-20T23:30:37.3940590Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-0b0a9ddb90faf8ba.xml (deflated 29%) 2024-08-20T23:30:37.3942035Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-11d64b0790f641b8.xml (deflated 58%) 2024-08-20T23:30:37.3943179Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-c56e0c632a466fc5.xml (deflated 73%) 2024-08-20T23:30:37.3944304Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-19e8311e51be88b7.xml (deflated 67%) 2024-08-20T23:30:37.3945439Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-952351d32d807991.xml (deflated 84%) 2024-08-20T23:30:37.3946583Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-536077b20436c395.xml (deflated 79%) 2024-08-20T23:30:37.3947716Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-63261557a3d90cef.xml (deflated 29%) 2024-08-20T23:30:37.3948846Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-7e15793039dac064.xml (deflated 76%) 2024-08-20T23:30:37.3949987Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-387afc7e488d8aec.xml (deflated 29%) 2024-08-20T23:30:37.3951133Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-361a12dddefba5ae.xml (deflated 35%) 2024-08-20T23:30:37.3952432Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-af8ac05b6fb9dc10.xml (deflated 29%) 2024-08-20T23:30:37.3953601Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-b68d52babaad5f1a.xml (deflated 48%) 2024-08-20T23:30:37.3954743Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-0d094c0bc93ccf13.xml (deflated 29%) 2024-08-20T23:30:37.3955955Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-1fc376019fff7c7b.xml (deflated 61%) 2024-08-20T23:30:37.3957332Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-e3248f2dc2d47e8d.xml (deflated 29%) 2024-08-20T23:30:37.3958484Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-0fdf6b3387841554.xml (deflated 36%) 2024-08-20T23:30:37.3959692Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-e8eac0802368c4b6.xml (deflated 29%) 2024-08-20T23:30:37.3960827Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-38a71fd1a46dc97b.xml (deflated 72%) 2024-08-20T23:30:37.3961978Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-5a8b5dddb6a622e8.xml (deflated 29%) 2024-08-20T23:30:37.3963128Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-19257ebb22991d5d.xml (deflated 84%) 2024-08-20T23:30:37.3964374Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-c0808012ec4fa4f6.xml (deflated 29%) 2024-08-20T23:30:37.3965514Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-39adbb35aef1c6b5.xml (deflated 52%) 2024-08-20T23:30:37.3966648Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-7506717cc1e6fe70.xml (deflated 28%) 2024-08-20T23:30:37.3968055Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-93b5d3333d4af613.xml (deflated 62%) 2024-08-20T23:30:37.3969528Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-6bf11ad7ad4fe1a9.xml (deflated 29%) 2024-08-20T23:30:37.3970673Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-a1fd38355b5e06bc.xml (deflated 88%) 2024-08-20T23:30:37.3971811Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-64ad9f83db2cb0d7.xml (deflated 30%) 2024-08-20T23:30:37.3972951Z adding: test/test-reports/python-pytest/test.run_test/test.run_test-9083f680d0f26ed7.xml (deflated 89%) 2024-08-20T23:30:37.4002185Z ##[group]Run # Remove any previous usage logs if they exist 2024-08-20T23:30:37.4002818Z # Remove any previous usage logs if they exist 2024-08-20T23:30:37.4003312Z rm -f logs-*.zip 2024-08-20T23:30:37.4003979Z # this workflow is also run in bazel build test, but we dont generate usage reports for it 2024-08-20T23:30:37.4004756Z # so check to see if the file exists first 2024-08-20T23:30:37.4005252Z if [ -f 'usage_log.txt' ]; then 2024-08-20T23:30:37.4005794Z  zip "logs-${FILE_SUFFIX}.zip" 'usage_log.txt' 2024-08-20T23:30:37.4006302Z fi 2024-08-20T23:30:37.4006664Z if ls test/**/*.log 1> /dev/null 2>&1; then 2024-08-20T23:30:37.4007254Z  zip -r "logs-${FILE_SUFFIX}.zip" test -i '*.log' 2024-08-20T23:30:37.4007750Z fi 2024-08-20T23:30:37.4016493Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:37.4016999Z env: 2024-08-20T23:30:37.4017290Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.4017743Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.4018481Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.4019365Z FILE_SUFFIX: test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828 2024-08-20T23:30:37.4020004Z ##[endgroup] 2024-08-20T23:30:37.4108553Z adding: usage_log.txt (deflated 92%) 2024-08-20T23:30:37.4417008Z adding: test/test-reports/inductor.test_max_autotune_1.1_90cbe4a42765cb97_.log (deflated 85%) 2024-08-20T23:30:37.4418125Z adding: test/test-reports/inductor.test_distributed_patterns_1.1_bf928dc8d1ea8e85_.log (deflated 82%) 2024-08-20T23:30:37.4531976Z adding: test/test-reports/test_utils_1.1_49f3548ae768572f_.log (deflated 96%) 2024-08-20T23:30:37.4595304Z adding: test/test-reports/test_nn_1.1_ebef799667f24a9c_.log (deflated 96%) 2024-08-20T23:30:37.4596356Z adding: test/test-reports/inductor.test_torchinductor_opinfo_3.13_754488b9a0974eb2_.log (deflated 52%) 2024-08-20T23:30:37.4597523Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_4.6_e9187bea32da0bdb_.log (deflated 52%) 2024-08-20T23:30:37.4598713Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_5.6_b0ce7145d0005719_.log (deflated 52%) 2024-08-20T23:30:37.4599969Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_6.6_44afbbef75406b9b_.log (deflated 52%) 2024-08-20T23:30:37.4601057Z adding: test/test-reports/inductor.test_mmdecomp_1.1_a4a2a61d68ecb080_.log (deflated 50%) 2024-08-20T23:30:37.4602038Z adding: test/test-reports/dynamo.test_interop_1.1_cdd06ccb6ec3fce6_.log (deflated 50%) 2024-08-20T23:30:37.4603009Z adding: test/test-reports/dynamo.test_logging_1.1_ec02132695f78500_.log (deflated 50%) 2024-08-20T23:30:37.4603938Z adding: test/test-reports/dynamo.test_exc_1.1_645c1493b3b39705_.log (deflated 49%) 2024-08-20T23:30:37.4605074Z adding: test/test-reports/dynamo.test_global_1.1_e9904aa1427ada16_.log (deflated 50%) 2024-08-20T23:30:37.4606026Z adding: test/test-reports/dynamo.test_unspec_1.1_5da9c400816668ab_.log (deflated 50%) 2024-08-20T23:30:37.4607045Z adding: test/test-reports/inductor.test_cudagraph_trees_1.1_4ed39b64fe3fa186_.log (deflated 51%) 2024-08-20T23:30:37.4608080Z adding: test/test-reports/dynamo.test_ctx_manager_1.1_39bb636c6cc72699_.log (deflated 50%) 2024-08-20T23:30:37.4609056Z adding: test/test-reports/dynamo.test_subgraphs_1.1_6d75ef8d2c982096_.log (deflated 50%) 2024-08-20T23:30:37.4610089Z adding: test/test-reports/inductor.test_pattern_matcher_1.1_42859534082d29c5_.log (deflated 51%) 2024-08-20T23:30:37.4611147Z adding: test/test-reports/dynamo.test_autograd_function_1.1_83c351875f80d4c7_.log (deflated 51%) 2024-08-20T23:30:37.4612246Z adding: test/test-reports/dynamo.test_activation_checkpointing_1.1_f01f200d89d7ffe6_.log (deflated 52%) 2024-08-20T23:30:37.4613350Z adding: test/test-reports/inductor.test_inductor_freezing_1.1_c7bfbc6ba6b8cf25_.log (deflated 52%) 2024-08-20T23:30:37.4614473Z adding: test/test-reports/inductor.test_mkldnn_pattern_matcher_1.1_f0af5f6af3934195_.log (deflated 52%) 2024-08-20T23:30:37.4615554Z adding: test/test-reports/inductor.test_aot_inductor_6.16_29ce288076f824ea_.log (deflated 51%) 2024-08-20T23:30:37.4616589Z adding: test/test-reports/inductor.test_aot_inductor_7.16_cea4703a7db51c70_.log (deflated 51%) 2024-08-20T23:30:37.4617639Z adding: test/test-reports/inductor.test_aot_inductor_14.16_e938f28b87cc8c41_.log (deflated 51%) 2024-08-20T23:30:37.4618681Z adding: test/test-reports/inductor.test_cpu_cpp_wrapper_1.1_790233f39735ecf4_.log (stored 0%) 2024-08-20T23:30:37.4619808Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_4.6_751d6449bb22a520_.log (deflated 91%) 2024-08-20T23:30:37.4623241Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_5.6_d0a003dead724c4a_.log (deflated 90%) 2024-08-20T23:30:37.4624347Z adding: test/test-reports/inductor.test_mmdecomp_1.1_8091e9ab0f841e91_.log (deflated 83%) 2024-08-20T23:30:37.4625333Z adding: test/test-reports/dynamo.test_interop_1.1_bfdf717c06dc6d1c_.log (deflated 60%) 2024-08-20T23:30:37.4626429Z adding: test/test-reports/dynamo.test_logging_1.1_d24298d44e58599e_.log (deflated 87%) 2024-08-20T23:30:37.4627361Z adding: test/test-reports/dynamo.test_exc_1.1_dd2fb399061bcfaa_.log (deflated 70%) 2024-08-20T23:30:37.4628306Z adding: test/test-reports/dynamo.test_global_1.1_9d25f6270b81f203_.log (deflated 74%) 2024-08-20T23:30:37.4629648Z adding: test/test-reports/dynamo.test_unspec_1.1_93c118265b6937fb_.log (deflated 80%) 2024-08-20T23:30:37.4632879Z adding: test/test-reports/inductor.test_cudagraph_trees_1.1_f60c058c5a614110_.log (deflated 87%) 2024-08-20T23:30:37.4641645Z adding: test/test-reports/inductor.test_torchinductor_opinfo_3.13_8770fb90d6be8e5a_.log (deflated 91%) 2024-08-20T23:30:37.4643100Z adding: test/test-reports/dynamo.test_ctx_manager_1.1_75855fe6ab32fe1c_.log (deflated 84%) 2024-08-20T23:30:37.4644637Z adding: test/test-reports/dynamo.test_subgraphs_1.1_a61c88364af4b0f0_.log (deflated 83%) 2024-08-20T23:30:37.4645919Z adding: test/test-reports/dynamo.test_autograd_function_1.1_ad8e2fcd08863b7b_.log (deflated 83%) 2024-08-20T23:30:37.4647398Z adding: test/test-reports/dynamo.test_activation_checkpointing_1.1_7e7764c45dd50549_.log (deflated 84%) 2024-08-20T23:30:37.4648965Z adding: test/test-reports/inductor.test_inductor_freezing_1.1_8de405f949c5ce50_.log (deflated 86%) 2024-08-20T23:30:37.4651407Z adding: test/test-reports/inductor.test_pattern_matcher_1.1_df0e9bce35c5ebaf_.log (deflated 83%) 2024-08-20T23:30:37.4659594Z adding: test/test-reports/inductor.test_torchinductor_dynamic_shapes_6.6_0a0b3453adb372fa_.log (deflated 91%) 2024-08-20T23:30:37.4664772Z adding: test/test-reports/inductor.test_mkldnn_pattern_matcher_1.1_2ceeacc2a014a25b_.log (deflated 93%) 2024-08-20T23:30:37.4668071Z adding: test/test-reports/inductor.test_aot_inductor_6.16_95daac6689370c5a_.log (deflated 88%) 2024-08-20T23:30:37.4669309Z adding: test/test-reports/inductor.test_cpu_cpp_wrapper_1.1_7eb9a7d851dfc193_.log (stored 0%) 2024-08-20T23:30:37.4681643Z adding: test/test-reports/inductor.test_aot_inductor_7.16_5ea37599c1ce7c3e_.log (deflated 92%) 2024-08-20T23:30:37.4700733Z adding: test/test-reports/inductor.test_aot_inductor_14.16_12ae829b1c880606_.log (deflated 94%) 2024-08-20T23:30:37.4701720Z adding: test/test-reports/cpp.Dict_test_1.1_aaaa37355bf8a051_.log (deflated 49%) 2024-08-20T23:30:37.4702631Z adding: test/test-reports/cpp.Dimname_test_1.1_a5340cd377bbd8ee_.log (deflated 49%) 2024-08-20T23:30:37.4703591Z adding: test/test-reports/cpp.NamedTensor_test_1.1_684d5d0abb1da5d7_.log (deflated 49%) 2024-08-20T23:30:37.4704559Z adding: test/test-reports/cpp.apply_utils_test_1.1_faaba58fe49bcd2f_.log (deflated 49%) 2024-08-20T23:30:37.4705477Z adding: test/test-reports/cpp.atest_1.1_be6f3b64ed920491_.log (deflated 49%) 2024-08-20T23:30:37.4706376Z adding: test/test-reports/cpp.Dimname_test_1.1_e1dcbccbbf55d0ff_.log (deflated 60%) 2024-08-20T23:30:37.4707316Z adding: test/test-reports/cpp.NamedTensor_test_1.1_0609195c843e387d_.log (deflated 72%) 2024-08-20T23:30:37.4708280Z adding: test/test-reports/cpp.apply_utils_test_1.1_6b8ea44231ad3dd4_.log (deflated 66%) 2024-08-20T23:30:37.4709207Z adding: test/test-reports/cpp.Dict_test_1.1_7533f43b4a5eca8b_.log (deflated 85%) 2024-08-20T23:30:37.4710082Z adding: test/test-reports/cpp.atest_1.1_ddda09b994deaa3e_.log (deflated 74%) 2024-08-20T23:30:37.4711002Z adding: test/test-reports/cpp.cuda_generator_test_1.1_6c55348e9bd6feb5_.log (deflated 49%) 2024-08-20T23:30:37.4712048Z adding: test/test-reports/cpp.cuda_generator_test_1.1_d9de8d24cd176ac3_.log (deflated 75%) 2024-08-20T23:30:37.4713035Z adding: test/test-reports/cpp.cuda_half_test_1.1_502ab3190b60f479_.log (deflated 49%) 2024-08-20T23:30:37.4713965Z adding: test/test-reports/cpp.cuda_half_test_1.1_c7f7da4995c3237a_.log (deflated 49%) 2024-08-20T23:30:37.4714952Z adding: test/test-reports/cpp.cuda_vectorized_test_1.1_262b4d2d0851d725_.log (deflated 49%) 2024-08-20T23:30:37.4715948Z adding: test/test-reports/cpp.cuda_vectorized_test_1.1_b03a1b339e1c3906_.log (deflated 56%) 2024-08-20T23:30:37.4716963Z adding: test/test-reports/cpp.cuda_distributions_test_1.1_b27957ce457b6ca5_.log (deflated 49%) 2024-08-20T23:30:37.4717990Z adding: test/test-reports/cpp.cuda_distributions_test_1.1_3e05a59c6dcf4e62_.log (deflated 64%) 2024-08-20T23:30:37.4719013Z adding: test/test-reports/cpp.cuda_optional_test_1.1_0f7550f7d889dd5e_.log (deflated 49%) 2024-08-20T23:30:37.4720052Z adding: test/test-reports/cpp.cuda_optional_test_1.1_c67b9a9787d814ef_.log (deflated 50%) 2024-08-20T23:30:37.4721205Z adding: test/test-reports/cpp.cuda_complex_test_1.1_fcbedcb022e59884_.log (deflated 49%) 2024-08-20T23:30:37.4722185Z adding: test/test-reports/cpp.cuda_complex_test_1.1_4011fd5908d1ae23_.log (deflated 71%) 2024-08-20T23:30:37.4723197Z adding: test/test-reports/cpp.cuda_complex_math_test_1.1_7c4c65ed65b01632_.log (deflated 49%) 2024-08-20T23:30:37.4724220Z adding: test/test-reports/cpp.cuda_complex_math_test_1.1_68da292c4f34bcc6_.log (deflated 82%) 2024-08-20T23:30:37.4725190Z adding: test/test-reports/cpp.cuda_cub_test_1.1_b2bfe7cc369b9fc0_.log (deflated 49%) 2024-08-20T23:30:37.4726132Z adding: test/test-reports/cpp.cuda_cub_test_1.1_653ea1cf7654f4e9_.log (deflated 57%) 2024-08-20T23:30:37.4727098Z adding: test/test-reports/cpp.cuda_atomic_ops_test_1.1_30873c2559863a6c_.log (deflated 49%) 2024-08-20T23:30:37.4728090Z adding: test/test-reports/cpp.cuda_atomic_ops_test_1.1_d5a336d04929c864_.log (deflated 63%) 2024-08-20T23:30:37.4729047Z adding: test/test-reports/cpp.test_api_1.1_3b6eda8b51a0f983_.log (deflated 48%) 2024-08-20T23:30:37.4752603Z adding: test/test-reports/cpp.test_api_1.1_4bc39d765caf7435_.log (deflated 93%) 2024-08-20T23:30:37.4753660Z adding: test/test-reports/cpp.test_tensorexpr_1.1_57382aa597ab4482_.log (deflated 48%) 2024-08-20T23:30:37.4779633Z adding: test/test-reports/cpp.test_tensorexpr_1.1_4c13cfeca17e4fbc_.log (deflated 94%) 2024-08-20T23:30:37.4812940Z ##[group]Run # Remove any previous debugging artifacts if they exist 2024-08-20T23:30:37.4813666Z # Remove any previous debugging artifacts if they exist 2024-08-20T23:30:37.4814208Z rm -f debug-*.zip 2024-08-20T23:30:37.4814595Z if [ -d 'test/debug' ]; then 2024-08-20T23:30:37.4815111Z  zip -r "debug-${FILE_SUFFIX}.zip" test/debug 2024-08-20T23:30:37.4815577Z fi 2024-08-20T23:30:37.4824290Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:37.4824810Z env: 2024-08-20T23:30:37.4825089Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.4825544Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.4826285Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.4827161Z FILE_SUFFIX: test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828 2024-08-20T23:30:37.4827803Z ##[endgroup] 2024-08-20T23:30:37.4924304Z ##[group]Run seemethere/upload-artifact-s3@v5 2024-08-20T23:30:37.4924752Z with: 2024-08-20T23:30:37.4925036Z s3-bucket: gha-artifacts 2024-08-20T23:30:37.4925468Z s3-prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:37.4925938Z retention-days: 14 2024-08-20T23:30:37.4926268Z if-no-files-found: warn 2024-08-20T23:30:37.4926628Z path: test-jsons-*.zip 2024-08-20T23:30:37.4926970Z name: artifact 2024-08-20T23:30:37.4927269Z region: us-east-1 2024-08-20T23:30:37.4927572Z env: 2024-08-20T23:30:37.4927855Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:37.4928313Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:37.4929042Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:37.4929684Z ##[endgroup] 2024-08-20T23:30:37.8469182Z NOTE: s3-prefix specified, ignoring name parameter 2024-08-20T23:30:37.8469796Z With the provided path, there will be 1 file uploaded 2024-08-20T23:30:37.8470442Z Uploading to s3 prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:37.9435282Z Starting upload of test-jsons-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:38.1045954Z Finished upload of test-jsons-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:38.1331943Z ##[group]Run seemethere/upload-artifact-s3@v5 2024-08-20T23:30:38.1332387Z with: 2024-08-20T23:30:38.1332670Z s3-bucket: gha-artifacts 2024-08-20T23:30:38.1333109Z s3-prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:38.1333767Z retention-days: 14 2024-08-20T23:30:38.1334102Z if-no-files-found: error 2024-08-20T23:30:38.1334473Z path: test-reports-*.zip 2024-08-20T23:30:38.1334827Z name: artifact 2024-08-20T23:30:38.1335131Z region: us-east-1 2024-08-20T23:30:38.1335438Z env: 2024-08-20T23:30:38.1335724Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:38.1336174Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:38.1336928Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:38.1337570Z ##[endgroup] 2024-08-20T23:30:38.4870700Z NOTE: s3-prefix specified, ignoring name parameter 2024-08-20T23:30:38.4871857Z With the provided path, there will be 1 file uploaded 2024-08-20T23:30:38.4872478Z Uploading to s3 prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:38.4927821Z Starting upload of test-reports-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:38.6703427Z Finished upload of test-reports-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:38.6987905Z ##[group]Run seemethere/upload-artifact-s3@v5 2024-08-20T23:30:38.6988336Z with: 2024-08-20T23:30:38.6988626Z s3-bucket: gha-artifacts 2024-08-20T23:30:38.6989054Z s3-prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:38.6989517Z retention-days: 14 2024-08-20T23:30:38.6989852Z if-no-files-found: ignore 2024-08-20T23:30:38.6990409Z path: logs-*.zip 2024-08-20T23:30:38.6990717Z name: artifact 2024-08-20T23:30:38.6991035Z region: us-east-1 2024-08-20T23:30:38.6991339Z env: 2024-08-20T23:30:38.6991607Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:38.6992053Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:38.6992777Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:38.6993416Z ##[endgroup] 2024-08-20T23:30:39.0480782Z NOTE: s3-prefix specified, ignoring name parameter 2024-08-20T23:30:39.0481372Z With the provided path, there will be 1 file uploaded 2024-08-20T23:30:39.0482033Z Uploading to s3 prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:39.0537134Z Starting upload of logs-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:39.2038663Z Finished upload of logs-test-default-1-5-amz2023.linux.g5.4xlarge.nvidia.gpu_29026448828.zip 2024-08-20T23:30:39.2323320Z ##[group]Run seemethere/upload-artifact-s3@v5 2024-08-20T23:30:39.2323764Z with: 2024-08-20T23:30:39.2324059Z s3-bucket: gha-artifacts 2024-08-20T23:30:39.2324494Z s3-prefix: pytorch/pytorch/10479310961/1/artifact 2024-08-20T23:30:39.2324949Z retention-days: 14 2024-08-20T23:30:39.2325286Z if-no-files-found: ignore 2024-08-20T23:30:39.2325641Z path: debug-*.zip 2024-08-20T23:30:39.2325947Z name: artifact 2024-08-20T23:30:39.2326248Z region: us-east-1 2024-08-20T23:30:39.2326550Z env: 2024-08-20T23:30:39.2326828Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:39.2327287Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:39.2328029Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:39.2328661Z ##[endgroup] 2024-08-20T23:30:39.5758141Z No files were found with the provided path: debug-*.zip. No artifacts will be uploaded. 2024-08-20T23:30:39.6040218Z ##[group]Run # shellcheck disable=SC2156 2024-08-20T23:30:39.6040711Z # shellcheck disable=SC2156 2024-08-20T23:30:39.6041492Z find . -iname "core.[1-9]*" -exec docker exec "${DOCKER_CONTAINER_ID}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2024-08-20T23:30:39.6050631Z shell: /usr/bin/bash -e {0} 2024-08-20T23:30:39.6050986Z env: 2024-08-20T23:30:39.6051279Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:39.6051725Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:39.6052456Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:39.6053173Z ##[endgroup] 2024-08-20T23:30:39.8637026Z ##[group]Run pytorch/test-infra/.github/actions/teardown-linux@main 2024-08-20T23:30:39.8637760Z with: 2024-08-20T23:30:39.8638028Z env: 2024-08-20T23:30:39.8638320Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:39.8638773Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:39.8639608Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:39.8640251Z ##[endgroup] 2024-08-20T23:30:39.8659907Z ##[group]Run set -eou pipefail 2024-08-20T23:30:39.8660319Z set -eou pipefail 2024-08-20T23:30:39.8660683Z  2024-08-20T23:30:39.8661196Z echo "Holding runner for 2 hours until all ssh sessions have logged out" 2024-08-20T23:30:39.8661834Z for _ in $(seq 1440); do 2024-08-20T23:30:39.8662317Z  # Break if no ssh session exists anymore 2024-08-20T23:30:39.8662822Z  if [ "$(who)" = "" ]; then 2024-08-20T23:30:39.8663232Z  break 2024-08-20T23:30:39.8663591Z  fi 2024-08-20T23:30:39.8663898Z  echo "." 2024-08-20T23:30:39.8664228Z  sleep 5 2024-08-20T23:30:39.8664549Z done 2024-08-20T23:30:39.8674020Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:39.8674535Z env: 2024-08-20T23:30:39.8674827Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:39.8675280Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:39.8676020Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:39.8676668Z ##[endgroup] 2024-08-20T23:30:39.8706395Z Holding runner for 2 hours until all ssh sessions have logged out 2024-08-20T23:30:39.8793607Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-08-20T23:30:39.8794363Z # ignore expansion of "docker ps -q" since it could be empty 2024-08-20T23:30:39.8794946Z # shellcheck disable=SC2046 2024-08-20T23:30:39.8795417Z docker stop $(docker ps -q) || true 2024-08-20T23:30:39.8795898Z # Prune all of the docker images 2024-08-20T23:30:39.8796347Z docker system prune -af 2024-08-20T23:30:39.8805273Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:39.8805760Z env: 2024-08-20T23:30:39.8806044Z GIT_DEFAULT_BRANCH: main 2024-08-20T23:30:39.8806500Z GPU_FLAG: --gpus all -e NVIDIA_DRIVER_CAPABILITIES=all 2024-08-20T23:30:39.8807216Z DOCKER_CONTAINER_ID: 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:39.8807855Z ##[endgroup] 2024-08-20T23:30:40.6566807Z 00fa8332bfd4 2024-08-20T23:30:43.3196673Z Deleted Containers: 2024-08-20T23:30:43.3197374Z 00fa8332bfd43a435ff056113a1f910a3cfb925b205ebffeda11baf2d1c7b3fe 2024-08-20T23:30:43.3197939Z 2024-08-20T23:30:53.1688538Z Deleted Images: 2024-08-20T23:30:53.1690175Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9:f6d216893d65c7b8ae43df4daaf247db808378e9 2024-08-20T23:30:53.1692687Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-cuda12.4-cudnn9-py3-gcc9@sha256:b9572efb5db3e0afffd147f045c42cead9b20264103a375915f77858de439a1d 2024-08-20T23:30:53.1694497Z deleted: sha256:0e10e8f27e667bb8effbc6011be2637bcca6e8a882c60bd12fee3a26ef6dcc2e 2024-08-20T23:30:53.1695621Z deleted: sha256:2884358cf578f8e1775702266e9f746225ebc90eed6dc34a4187b91c75acf350 2024-08-20T23:30:53.1696499Z deleted: sha256:2dcf7c862145e95cbbff126e6cd6e839698dc988b003206759f900b124698afd 2024-08-20T23:30:53.1697305Z deleted: sha256:477a530347043590e4341c4df924ccc549b9dd58be35fe99c8d3c29c8a7afe03 2024-08-20T23:30:53.1698127Z deleted: sha256:065c07f50bc5e74cbfe54da8841275a957d4d78ab39bd83835fa020d15cf7501 2024-08-20T23:30:53.1698961Z deleted: sha256:390cbb94dbc1ee8ef4f729fd9dda11a45d402f4510171095139651a9750c7949 2024-08-20T23:30:53.1699772Z deleted: sha256:905287e3135508aad6a220d5e88a76d6c14a33f4e0a5dd83afa8d67a7b0d4610 2024-08-20T23:30:53.1700941Z deleted: sha256:ecb0d1edf06d610902a5a654e45e535e821f8601074d6f3a30e1b69a8f8351ca 2024-08-20T23:30:53.1702017Z deleted: sha256:aab676b9246bbb7c8c7ddc2af9a6d0fd87af671d79103bda7b7d0142806f9c0f 2024-08-20T23:30:53.1703042Z deleted: sha256:d05b3deebb9abbd9c5c291ad910dbad81b3713aadf270ddc523575b69c04ebbd 2024-08-20T23:30:53.1703896Z deleted: sha256:9743f48a5b924661067248560021b71d3541a89ffff8517fb75abf2974d54fe1 2024-08-20T23:30:53.1704695Z deleted: sha256:d8fc2670e810f0396684899b6749274bd97731333ffbc0a8276bfe084800e8f9 2024-08-20T23:30:53.1705524Z deleted: sha256:62d5679fcaea02cc04e9be2fa8376cebef48bc040adadc7955894b41c46b7b13 2024-08-20T23:30:53.1706355Z deleted: sha256:1f7cb1d23b8f99a733216f68c2de37d3ddcc62e3733593f0df4dd641c5787cfe 2024-08-20T23:30:53.1707166Z deleted: sha256:35f7a12102b023ef2f796612c6b05618bd27de611193529494b0446f1bb3e163 2024-08-20T23:30:53.1707966Z deleted: sha256:712b8587f737083255ea8818a7acc522cf09033945fedc38eb7873ade2160651 2024-08-20T23:30:53.1708774Z deleted: sha256:c82ec77f35f0e3069f26580872687c9fc3f4d2c3318ade8f6cc740d969c92134 2024-08-20T23:30:53.1709588Z deleted: sha256:7ea91df87e39d2a4c76ba556117492657dd60a3850412ffd62ee428130e40038 2024-08-20T23:30:53.1710507Z deleted: sha256:69b1f132579b17a784fbf8509e29204d9a363548042cbe7e599e45662a2fc547 2024-08-20T23:30:53.1711592Z deleted: sha256:94ac72d1753b40e73617ddfeb578fbebc8604f0bab1208f7d2e87e8f42243e7e 2024-08-20T23:30:53.1712423Z deleted: sha256:98ef5f96999dc26d13adcff596555978a33b7222b80fa5238c6e89b8d279a0ee 2024-08-20T23:30:53.1713324Z deleted: sha256:3f6c3e8f5fed7b29f753604e1316cd34d922e5365e9b29931810a43cd6485db6 2024-08-20T23:30:53.1714206Z deleted: sha256:59561360a00be3102ef8d8f21946769085910e1d9c06ab71f7116d2aadb8ec25 2024-08-20T23:30:53.1714995Z deleted: sha256:9d14eb17507155b6a9d300596b6eabaf54fe454842d7e3ac763729c42fd41fa9 2024-08-20T23:30:53.1715818Z deleted: sha256:62dec881683121b16ffe4bb25f2f61de35da7ca45e9ebddea7a2d7721c510f94 2024-08-20T23:30:53.1716647Z deleted: sha256:b263d7d48d0bb66cea06b4d3824d22ff2973609df2fa8d2b5ae2f8108cd1565b 2024-08-20T23:30:53.1717470Z deleted: sha256:5cdb74e88e1bd2ec4ee2e6443211bc57efa9c4242d1e9d4462360106c1e55394 2024-08-20T23:30:53.1718295Z deleted: sha256:dba128a1651a9922658c94b3ac716ec557dfea4d2f96b36561ea76c67a6158e2 2024-08-20T23:30:53.1719088Z deleted: sha256:fff8178147779ce7efb82577b221c44032bb5362507251e45749c0214a5a939b 2024-08-20T23:30:53.1719969Z deleted: sha256:da15a74b1a9a522f507b0428f08350a885c65c4178e01babf17dc79e496b9674 2024-08-20T23:30:53.1720765Z deleted: sha256:dfcdc2d67653c2596c578247a3779ac961f1701eeec6c5b85b69cd02afe2a49d 2024-08-20T23:30:53.1721564Z deleted: sha256:8a9258b84a8824b23aa471e31479dd49bd892da4f79351c4a7a72b88529a3ed5 2024-08-20T23:30:53.1722372Z deleted: sha256:b2ff50c122f92c4a6ff4eccd9f8622a6922bdde87c13f167c2c768136bd3d2fa 2024-08-20T23:30:53.1723174Z deleted: sha256:409965bf423e009ad23cae2db88de8b950633f84948b8e196c09598ff6415bb5 2024-08-20T23:30:53.1723966Z deleted: sha256:3076c99d9c7598845b2877b1398988e6501994ba9c6ce32a9cf93f771ce6ad7d 2024-08-20T23:30:53.1724767Z deleted: sha256:ed66a53ed53687e750f87b1ab915bec76e1e9c47cf9265c61fe187ab235fecf8 2024-08-20T23:30:53.1725619Z deleted: sha256:4130fe3b95d8755b5d13214b92dce4f011c11dc54102af6a6621b8c72b53dccd 2024-08-20T23:30:53.1726431Z deleted: sha256:0ed25b448a83a2ba4dd267eab5ef897a8eaf0bb65cccc1c51333f1d606b22107 2024-08-20T23:30:53.1727280Z deleted: sha256:2c65da2fd78a0629d2fcb34961b01c3aa991572c0f226845a4f7605c321138e3 2024-08-20T23:30:53.1728093Z deleted: sha256:b3a7fd872d9f4e244c032b56a24717df7e1e5f8ecfd7731837dd593e810cee40 2024-08-20T23:30:53.1728884Z deleted: sha256:43d0920e05f176a5aeb1f620f157b8342b05f055ec700101f9981535f94feb5a 2024-08-20T23:30:53.1729680Z deleted: sha256:1ce3adfb34a4c3aae10bc94c0088782afc6f36982e3029277066199fedd93ed2 2024-08-20T23:30:53.1730472Z deleted: sha256:83c6543469526f163e772400878f6f8897a2780dcafbff2669b0bdf5ae970503 2024-08-20T23:30:53.1731254Z deleted: sha256:5cf3d30368614aa1f8a0ef306b2e2237b4541bd98a8d2c73f75ed64adc242f75 2024-08-20T23:30:53.1732219Z deleted: sha256:dc9f273265be80d0e82ec74e3633f62358c37af364ef42b4f9b31b3d64e8c367 2024-08-20T23:30:53.1733117Z deleted: sha256:3921afa75c295ac1918fe8a9da2f377b78e6e0bf28abffffe1a823af4b27ad91 2024-08-20T23:30:53.1733949Z deleted: sha256:8a1bbfef49d93f4aa9cc6d69a66e34a0a95392a29c989dbf7e2eb40bbc539da5 2024-08-20T23:30:53.1734768Z deleted: sha256:1d2b2dcae0a730bfb7993f54f86cb2dfc8309935c23f86e15e6287285d2a5be3 2024-08-20T23:30:53.1735575Z deleted: sha256:21769132a7722c8b89315d0eede01c67c2f8ca4ffbe2fb481070f49e5dc6c4fb 2024-08-20T23:30:53.1736375Z deleted: sha256:d101a771d49279a3ca92ec6730a0fb10d256769c4ae8c4a66fdc66790942a51f 2024-08-20T23:30:53.1737169Z deleted: sha256:49704855a4abfa48c4cd293181f59759234c10c3e5a3b3598229261e3f9500f8 2024-08-20T23:30:53.1737990Z deleted: sha256:1c5df74f67b117e3cd59d2085d4e285814e67977a4b1dba38980237c090425b6 2024-08-20T23:30:53.1739059Z deleted: sha256:a9ba1a618d14c895c3f08485813ecb22ead621d79e164553f9b51e7ca781ace0 2024-08-20T23:30:53.1740089Z deleted: sha256:0d99196884c41ab7258518026966324210397a0649ce66ebebfdc2d2c197b215 2024-08-20T23:30:53.1740892Z deleted: sha256:f03fd4500632bdb922622e51db4a2f391a19ed96856958fe8b2089a6caf29b1f 2024-08-20T23:30:53.1741704Z deleted: sha256:5457343c72debeafa51c6b95547a3a7d3c4f877a54a910bd88fe7d3217e0bf59 2024-08-20T23:30:53.1742516Z deleted: sha256:f2346598f9e99e5fa862c22abc5850ed168cd8c5753e5cdc4d30da1efdbef784 2024-08-20T23:30:53.1743307Z deleted: sha256:362c6e854f200b8f3370a218506ef497c5425e788bedf6409daf488a529aaede 2024-08-20T23:30:53.1744108Z deleted: sha256:217e6c9c5754a8e01c7f4a5aa6930dbd88966f2d52edee67bfcced6aed26d3d8 2024-08-20T23:30:53.1744924Z deleted: sha256:4d24bfb5f687afd5cd27546991be3d18bd7973b4ef6ac780de6f42d57f7eec72 2024-08-20T23:30:53.1745745Z deleted: sha256:ebeb9d805f5bbe32b4b90f9925f27abf78d21f00c6db3aae76ca97b8fe24d162 2024-08-20T23:30:53.1746570Z deleted: sha256:0c8a0f170fed1c564195f08bafdfa5f4e46a14dd74b851f15aad93172eeeb15d 2024-08-20T23:30:53.1747391Z deleted: sha256:13f2c977fc1729a9995604c0dfde376b7ff4d5be1cf1c5fbd8cc55c17cb0587f 2024-08-20T23:30:53.1748205Z deleted: sha256:f7a7bdf06d601a452b8949fcd84776d0ef3528dee4c4fb6b208ae1dd496e4b80 2024-08-20T23:30:53.1749029Z deleted: sha256:6668137610488b1cbb16bb60df3718777955eefb6c585fc1f35aec14a9c56539 2024-08-20T23:30:53.1749835Z deleted: sha256:55a4693dee36c4d323f2c933bc5a1f1e2d9d3beb987d27f69605715448e6850c 2024-08-20T23:30:53.1750653Z deleted: sha256:bc8844fce61c35cdb9f847d96b5ffcf18c111715b14eb84205e727b133e1a584 2024-08-20T23:30:53.1751458Z deleted: sha256:9c91d56fae1b3de9d0509b3340050333ae35d2794089098c7032ebf768fcba15 2024-08-20T23:30:53.1752261Z deleted: sha256:433544e4353c9e0269dc71fa16c00ddcecdcd5676ed0df618c7b51ac78e23053 2024-08-20T23:30:53.1753091Z deleted: sha256:fc24af9fd5121adfbc7b75df0abb1ef8c2d3868f33b7b6b1e8bf4e214fac4bd6 2024-08-20T23:30:53.1753902Z deleted: sha256:ce1492ad94b71204f8b3fb8903d0ad48a96652635e7fc9347e430639ca15a737 2024-08-20T23:30:53.1754698Z deleted: sha256:96f2ae9435e6e12f68a88dc404c8079b2796639589970c34894a86ccff3f3732 2024-08-20T23:30:53.1755566Z deleted: sha256:5310d2bc0952e7c6d8cbade7ecafd88ce3a04d9da303fa6419583e13a02b58a9 2024-08-20T23:30:53.1756376Z deleted: sha256:099076d608a55a027d60f5b572a3991a8d690a01500acf2c696933a7e6d38650 2024-08-20T23:30:53.1757178Z deleted: sha256:862b05d0f8f7e3dff2d04bd49784a9901f4e22c5dd6544ae72eb3be05e23fffe 2024-08-20T23:30:53.1757988Z deleted: sha256:f7a888e5f7904827f6d71cebcba5169dbc7a78b20f228b9b9b625f38e0c52f24 2024-08-20T23:30:53.1758803Z deleted: sha256:ab9f75a4a7a636aae480f630eaddc05133dda64f1c42445b617c97d3d209991c 2024-08-20T23:30:53.1759758Z deleted: sha256:56dda0c9cee64db5c10d5dd5f08c6f6707263f5ee98e8a075d635e5be996e855 2024-08-20T23:30:53.1760593Z deleted: sha256:5faf9c0a9efe4675ecd21a4ec417d51077d5e75da9e673161a94e7d6cd43f92c 2024-08-20T23:30:53.1761085Z 2024-08-20T23:30:53.1761246Z Total reclaimed space: 34.25GB 2024-08-20T23:30:53.1828631Z Post job cleanup. 2024-08-20T23:30:53.1879513Z Post job cleanup. 2024-08-20T23:30:53.2762720Z [command]/usr/bin/git version 2024-08-20T23:30:53.2828593Z git version 2.40.1 2024-08-20T23:30:53.2866161Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/8be961d0-325c-4fa6-a36a-f9ea0fce4fa7' before making global git config changes 2024-08-20T23:30:53.2867544Z Adding repository directory to the temporary git global config as a safe directory 2024-08-20T23:30:53.2871902Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2024-08-20T23:30:53.2919771Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-08-20T23:30:53.2964074Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-08-20T23:30:53.3349105Z Entering 'android/libs/fbjni' 2024-08-20T23:30:53.3420227Z Entering 'third_party/FP16' 2024-08-20T23:30:53.3489289Z Entering 'third_party/FXdiv' 2024-08-20T23:30:53.3558523Z Entering 'third_party/NNPACK' 2024-08-20T23:30:53.3627888Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T23:30:53.3696759Z Entering 'third_party/XNNPACK' 2024-08-20T23:30:53.3780147Z Entering 'third_party/benchmark' 2024-08-20T23:30:53.3848821Z Entering 'third_party/cpp-httplib' 2024-08-20T23:30:53.3920524Z Entering 'third_party/cpuinfo' 2024-08-20T23:30:53.3989571Z Entering 'third_party/cudnn_frontend' 2024-08-20T23:30:53.4060942Z Entering 'third_party/cutlass' 2024-08-20T23:30:53.4137228Z Entering 'third_party/eigen' 2024-08-20T23:30:53.4207052Z Entering 'third_party/fbgemm' 2024-08-20T23:30:53.4275309Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T23:30:53.4346011Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T23:30:53.4412725Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T23:30:53.4485695Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T23:30:53.4551099Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T23:30:53.4619313Z Entering 'third_party/flatbuffers' 2024-08-20T23:30:53.4692279Z Entering 'third_party/fmt' 2024-08-20T23:30:53.4760083Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T23:30:53.4828570Z Entering 'third_party/gloo' 2024-08-20T23:30:53.4897094Z Entering 'third_party/googletest' 2024-08-20T23:30:53.4967813Z Entering 'third_party/ideep' 2024-08-20T23:30:53.5034134Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T23:30:53.5109551Z Entering 'third_party/ittapi' 2024-08-20T23:30:53.5179790Z Entering 'third_party/kineto' 2024-08-20T23:30:53.5248159Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T23:30:53.5312924Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T23:30:53.5391426Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T23:30:53.5460290Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T23:30:53.5531643Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T23:30:53.5597583Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T23:30:53.5669898Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T23:30:53.5737826Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T23:30:53.5806567Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T23:30:53.5876117Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T23:30:53.5948288Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T23:30:53.6018958Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T23:30:53.6092390Z Entering 'third_party/mimalloc' 2024-08-20T23:30:53.6162327Z Entering 'third_party/nccl/nccl' 2024-08-20T23:30:53.6232490Z Entering 'third_party/nlohmann' 2024-08-20T23:30:53.6302591Z Entering 'third_party/onnx' 2024-08-20T23:30:53.6386903Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T23:30:53.6454015Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T23:30:53.6526342Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T23:30:53.6595739Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T23:30:53.6666298Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T23:30:53.6732743Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T23:30:53.6798685Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T23:30:53.6873578Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T23:30:53.6939946Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T23:30:53.7006308Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T23:30:53.7070200Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T23:30:53.7140481Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T23:30:53.7215277Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T23:30:53.7307299Z Entering 'third_party/pocketfft' 2024-08-20T23:30:53.7377247Z Entering 'third_party/protobuf' 2024-08-20T23:30:53.7448259Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T23:30:53.7514750Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T23:30:53.7584504Z Entering 'third_party/psimd' 2024-08-20T23:30:53.7654100Z Entering 'third_party/pthreadpool' 2024-08-20T23:30:53.7723577Z Entering 'third_party/pybind11' 2024-08-20T23:30:53.7793652Z Entering 'third_party/python-peachpy' 2024-08-20T23:30:53.7863394Z Entering 'third_party/sleef' 2024-08-20T23:30:53.7933269Z Entering 'third_party/tensorpipe' 2024-08-20T23:30:53.8001979Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T23:30:53.8071204Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T23:30:53.8139072Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T23:30:53.8205306Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T23:30:53.8278350Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T23:30:53.8376838Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-08-20T23:30:53.8413029Z http.https://github.com/.extraheader 2024-08-20T23:30:53.8423006Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2024-08-20T23:30:53.8468272Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-08-20T23:30:53.8838619Z Entering 'android/libs/fbjni' 2024-08-20T23:30:53.8885928Z http.https://github.com/.extraheader 2024-08-20T23:30:53.8930152Z Entering 'third_party/FP16' 2024-08-20T23:30:53.8977002Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9021131Z Entering 'third_party/FXdiv' 2024-08-20T23:30:53.9065444Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9109791Z Entering 'third_party/NNPACK' 2024-08-20T23:30:53.9154095Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9198870Z Entering 'third_party/VulkanMemoryAllocator' 2024-08-20T23:30:53.9243052Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9287178Z Entering 'third_party/XNNPACK' 2024-08-20T23:30:53.9330408Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9391024Z Entering 'third_party/benchmark' 2024-08-20T23:30:53.9434417Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9478278Z Entering 'third_party/cpp-httplib' 2024-08-20T23:30:53.9521619Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9565187Z Entering 'third_party/cpuinfo' 2024-08-20T23:30:53.9608617Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9652785Z Entering 'third_party/cudnn_frontend' 2024-08-20T23:30:53.9696116Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9740215Z Entering 'third_party/cutlass' 2024-08-20T23:30:53.9785538Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9836919Z Entering 'third_party/eigen' 2024-08-20T23:30:53.9880932Z http.https://github.com/.extraheader 2024-08-20T23:30:53.9925841Z Entering 'third_party/fbgemm' 2024-08-20T23:30:53.9969749Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0015105Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-08-20T23:30:54.0058500Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0100764Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-08-20T23:30:54.0144654Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0190635Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-08-20T23:30:54.0233117Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0283248Z Entering 'third_party/fbgemm/third_party/googletest' 2024-08-20T23:30:54.0326149Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0369347Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-08-20T23:30:54.0412145Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0459440Z Entering 'third_party/flatbuffers' 2024-08-20T23:30:54.0505434Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0552038Z Entering 'third_party/fmt' 2024-08-20T23:30:54.0598096Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0642341Z Entering 'third_party/gemmlowp/gemmlowp' 2024-08-20T23:30:54.0687282Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0732210Z Entering 'third_party/gloo' 2024-08-20T23:30:54.0781396Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0824290Z Entering 'third_party/googletest' 2024-08-20T23:30:54.0873208Z http.https://github.com/.extraheader 2024-08-20T23:30:54.0916568Z Entering 'third_party/ideep' 2024-08-20T23:30:54.0960149Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1002225Z Entering 'third_party/ideep/mkl-dnn' 2024-08-20T23:30:54.1044897Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1099510Z Entering 'third_party/ittapi' 2024-08-20T23:30:54.1142622Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1186262Z Entering 'third_party/kineto' 2024-08-20T23:30:54.1231463Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1274549Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-08-20T23:30:54.1318130Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1360666Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-08-20T23:30:54.1404909Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1451240Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-08-20T23:30:54.1495277Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1539504Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-08-20T23:30:54.1582714Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1627219Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-08-20T23:30:54.1672216Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1715093Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-08-20T23:30:54.1758998Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1807388Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-08-20T23:30:54.1850258Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1896046Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-08-20T23:30:54.1943432Z http.https://github.com/.extraheader 2024-08-20T23:30:54.1988235Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-08-20T23:30:54.2037935Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2083407Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-08-20T23:30:54.2126763Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2174585Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-08-20T23:30:54.2216901Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2260220Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-08-20T23:30:54.2301762Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2348103Z Entering 'third_party/mimalloc' 2024-08-20T23:30:54.2395184Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2443779Z Entering 'third_party/nccl/nccl' 2024-08-20T23:30:54.2488662Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2532704Z Entering 'third_party/nlohmann' 2024-08-20T23:30:54.2576887Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2621651Z Entering 'third_party/onnx' 2024-08-20T23:30:54.2665580Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2726934Z Entering 'third_party/onnx/third_party/benchmark' 2024-08-20T23:30:54.2770913Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2814881Z Entering 'third_party/onnx/third_party/pybind11' 2024-08-20T23:30:54.2858382Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2907863Z Entering 'third_party/opentelemetry-cpp' 2024-08-20T23:30:54.2951318Z http.https://github.com/.extraheader 2024-08-20T23:30:54.2997137Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-08-20T23:30:54.3039921Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3083225Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-08-20T23:30:54.3126206Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3169824Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-08-20T23:30:54.3212553Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3256179Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-08-20T23:30:54.3298653Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3343675Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-08-20T23:30:54.3386999Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3430469Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-08-20T23:30:54.3477825Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3520271Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-08-20T23:30:54.3563038Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3606165Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-08-20T23:30:54.3650092Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3697303Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-08-20T23:30:54.3740160Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3788220Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-08-20T23:30:54.3831463Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3897193Z Entering 'third_party/pocketfft' 2024-08-20T23:30:54.3941546Z http.https://github.com/.extraheader 2024-08-20T23:30:54.3985855Z Entering 'third_party/protobuf' 2024-08-20T23:30:54.4034279Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4080991Z Entering 'third_party/protobuf/third_party/benchmark' 2024-08-20T23:30:54.4124147Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4167173Z Entering 'third_party/protobuf/third_party/googletest' 2024-08-20T23:30:54.4209534Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4255564Z Entering 'third_party/psimd' 2024-08-20T23:30:54.4300515Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4344615Z Entering 'third_party/pthreadpool' 2024-08-20T23:30:54.4389349Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4433302Z Entering 'third_party/pybind11' 2024-08-20T23:30:54.4478726Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4523221Z Entering 'third_party/python-peachpy' 2024-08-20T23:30:54.4567583Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4611901Z Entering 'third_party/sleef' 2024-08-20T23:30:54.4655224Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4700428Z Entering 'third_party/tensorpipe' 2024-08-20T23:30:54.4743709Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4787393Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-08-20T23:30:54.4834458Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4877553Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-08-20T23:30:54.4925396Z http.https://github.com/.extraheader 2024-08-20T23:30:54.4968610Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-08-20T23:30:54.5011199Z http.https://github.com/.extraheader 2024-08-20T23:30:54.5054675Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-08-20T23:30:54.5097647Z http.https://github.com/.extraheader 2024-08-20T23:30:54.5138728Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-08-20T23:30:54.5181327Z http.https://github.com/.extraheader 2024-08-20T23:30:54.5341830Z A job completed hook has been configured by the self-hosted runner administrator 2024-08-20T23:30:54.5364287Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2024-08-20T23:30:54.5372837Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-08-20T23:30:54.5373354Z ##[endgroup] 2024-08-20T23:31:01.4664391Z Cleaning up orphan processes