2024-04-03T11:36:13.2562900Z ##[debug]Starting: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:36:13.2593421Z ##[debug]Cleaning runner temp folder: /home/pytorchci/actions-runner/_work/_temp 2024-04-03T11:36:13.3082644Z ##[debug]Starting: Set up job 2024-04-03T11:36:13.3083324Z Current runner version: '2.314.1' 2024-04-03T11:36:13.3086738Z Runner name: 'worker-rocm-amd-28' 2024-04-03T11:36:13.3087596Z Runner group name: 'linux.rocm.gpu.group' 2024-04-03T11:36:13.3088720Z Machine name: 'jenkins-worker-rocm-amd-28' 2024-04-03T11:36:13.3092031Z ##[group]GITHUB_TOKEN Permissions 2024-04-03T11:36:13.3093821Z Contents: read 2024-04-03T11:36:13.3094378Z Metadata: read 2024-04-03T11:36:13.3094857Z ##[endgroup] 2024-04-03T11:36:13.3098267Z Secret source: Actions 2024-04-03T11:36:13.3099084Z ##[debug]Primary repository: pytorch/pytorch 2024-04-03T11:36:13.3100183Z Prepare workflow directory 2024-04-03T11:36:13.3197670Z ##[debug]Update context data 2024-04-03T11:36:13.3202092Z ##[debug]Evaluating job-level environment variables 2024-04-03T11:36:13.4121439Z ##[debug]Evaluating: github.event.repository.default_branch 2024-04-03T11:36:13.4128050Z ##[debug]Evaluating Index: 2024-04-03T11:36:13.4130632Z ##[debug]..Evaluating Index: 2024-04-03T11:36:13.4131282Z ##[debug]....Evaluating Index: 2024-04-03T11:36:13.4131940Z ##[debug]......Evaluating github: 2024-04-03T11:36:13.4148706Z ##[debug]......=> Object 2024-04-03T11:36:13.4165871Z ##[debug]......Evaluating String: 2024-04-03T11:36:13.4166993Z ##[debug]......=> 'event' 2024-04-03T11:36:13.4171411Z ##[debug]....=> Object 2024-04-03T11:36:13.4172257Z ##[debug]....Evaluating String: 2024-04-03T11:36:13.4172947Z ##[debug]....=> 'repository' 2024-04-03T11:36:13.4173741Z ##[debug]..=> Object 2024-04-03T11:36:13.4174315Z ##[debug]..Evaluating String: 2024-04-03T11:36:13.4174926Z ##[debug]..=> 'default_branch' 2024-04-03T11:36:13.4176188Z ##[debug]=> 'main' 2024-04-03T11:36:13.4179304Z ##[debug]Result: 'main' 2024-04-03T11:36:13.4217407Z ##[debug]Evaluating job container 2024-04-03T11:36:13.4220879Z ##[debug]Evaluating job service containers 2024-04-03T11:36:13.4223535Z ##[debug]Evaluating job defaults 2024-04-03T11:36:13.8437968Z Prepare all required actions 2024-04-03T11:36:13.8722872Z Getting action download info 2024-04-03T11:36:14.3373435Z Download action repository 'pytorch/pytorch@main' (SHA:957b8d5c0098bdc43468b8e68659db3183dc06cf) 2024-04-03T11:36:18.8359031Z ##[debug]Download 'https://api.github.com/repos/pytorch/pytorch/tarball/957b8d5c0098bdc43468b8e68659db3183dc06cf' to '/home/pytorchci/actions-runner/_work/_actions/_temp_84789626-2473-4a9c-8911-6fed3a78c7f9/2dc3cf6d-1105-4a49-907d-1727c91843bb.tar.gz' 2024-04-03T11:36:20.5583695Z ##[debug]Unwrap 'pytorch-pytorch-957b8d5' to '/home/pytorchci/actions-runner/_work/_actions/pytorch/pytorch/main' 2024-04-03T11:36:22.0255592Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_84789626-2473-4a9c-8911-6fed3a78c7f9/2dc3cf6d-1105-4a49-907d-1727c91843bb.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/pytorch/pytorch/main'. 2024-04-03T11:36:22.3815388Z Download action repository 'aws-actions/configure-aws-credentials@v4' (SHA:e3dd6a429d7300a6a4c196c26e071d42e0343502) 2024-04-03T11:36:23.3142683Z ##[debug]Download 'https://api.github.com/repos/aws-actions/configure-aws-credentials/tarball/e3dd6a429d7300a6a4c196c26e071d42e0343502' to '/home/pytorchci/actions-runner/_work/_actions/_temp_c66108d8-0354-462f-b87a-5336e27c9c84/84d26581-ac6f-4cd8-b807-06746fe854b3.tar.gz' 2024-04-03T11:36:23.3552822Z ##[debug]Unwrap 'aws-actions-configure-aws-credentials-e3dd6a4' to '/home/pytorchci/actions-runner/_work/_actions/aws-actions/configure-aws-credentials/v4' 2024-04-03T11:36:23.3640533Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_c66108d8-0354-462f-b87a-5336e27c9c84/84d26581-ac6f-4cd8-b807-06746fe854b3.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/aws-actions/configure-aws-credentials/v4'. 2024-04-03T11:36:23.3716546Z Download action repository 'aws-actions/amazon-ecr-login@v2' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2024-04-03T11:36:24.2392245Z ##[debug]Download 'https://api.github.com/repos/aws-actions/amazon-ecr-login/tarball/062b18b96a7aff071d4dc91bc00c4c1a7945b076' to '/home/pytorchci/actions-runner/_work/_actions/_temp_ca6ae35b-653f-40e8-8300-7b95d42a3436/d8035da0-7b85-4c8a-99f4-1d7a2c13eb2e.tar.gz' 2024-04-03T11:36:24.2710696Z ##[debug]Unwrap 'aws-actions-amazon-ecr-login-062b18b' to '/home/pytorchci/actions-runner/_work/_actions/aws-actions/amazon-ecr-login/v2' 2024-04-03T11:36:24.2805308Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_ca6ae35b-653f-40e8-8300-7b95d42a3436/d8035da0-7b85-4c8a-99f4-1d7a2c13eb2e.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/aws-actions/amazon-ecr-login/v2'. 2024-04-03T11:36:24.2856336Z Download action repository 'pytorch/test-infra@main' (SHA:15d02d5046abe95e7d5eeeac8c2bd9d965e6f567) 2024-04-03T11:36:25.3836023Z ##[debug]Download 'https://api.github.com/repos/pytorch/test-infra/tarball/15d02d5046abe95e7d5eeeac8c2bd9d965e6f567' to '/home/pytorchci/actions-runner/_work/_actions/_temp_d8e6cbf4-fb8d-42f6-b940-279875b550d5/c884d06d-fc66-4cc5-9511-c40b88ba030c.tar.gz' 2024-04-03T11:36:25.4770101Z ##[debug]Unwrap 'pytorch-test-infra-15d02d5' to '/home/pytorchci/actions-runner/_work/_actions/pytorch/test-infra/main' 2024-04-03T11:36:25.5684316Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_d8e6cbf4-fb8d-42f6-b940-279875b550d5/c884d06d-fc66-4cc5-9511-c40b88ba030c.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/pytorch/test-infra/main'. 2024-04-03T11:36:25.6178461Z ##[debug]Action 'pytorch/test-infra@main' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/pytorch/test-infra/main'. 2024-04-03T11:36:25.6189732Z Download action repository 'actions/upload-artifact@v3' (SHA:a8a3f3ad30e3422c9c7b888a15615d19a852ae32) 2024-04-03T11:36:26.4817042Z ##[debug]Download 'https://api.github.com/repos/actions/upload-artifact/tarball/a8a3f3ad30e3422c9c7b888a15615d19a852ae32' to '/home/pytorchci/actions-runner/_work/_actions/_temp_ce03e77e-ba06-4135-b8e0-18c47cd030ca/f9adf2be-10a7-4464-972a-17fd4c7cf508.tar.gz' 2024-04-03T11:36:26.5065620Z ##[debug]Unwrap 'actions-upload-artifact-a8a3f3a' to '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3' 2024-04-03T11:36:26.5173619Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_ce03e77e-ba06-4135-b8e0-18c47cd030ca/f9adf2be-10a7-4464-972a-17fd4c7cf508.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3'. 2024-04-03T11:36:26.5340052Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/pytorch/pytorch/main/.github/actions/checkout-pytorch/action.yml'. 2024-04-03T11:36:26.6159470Z Getting action download info 2024-04-03T11:36:26.7725761Z Download action repository 'malfet/checkout@silent-checkout' (SHA:e07af140b3ccefc05679e3755b9db68f4ee4589c) 2024-04-03T11:36:27.7248694Z ##[debug]Download 'https://api.github.com/repos/malfet/checkout/tarball/e07af140b3ccefc05679e3755b9db68f4ee4589c' to '/home/pytorchci/actions-runner/_work/_actions/_temp_0f9d368e-937e-4465-8d38-4e53295a8e53/32e4a2df-c2c3-4da9-95e7-a21ed4f76d38.tar.gz' 2024-04-03T11:36:27.7579955Z ##[debug]Unwrap 'malfet-checkout-e07af14' to '/home/pytorchci/actions-runner/_work/_actions/malfet/checkout/silent-checkout' 2024-04-03T11:36:27.7736054Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_0f9d368e-937e-4465-8d38-4e53295a8e53/32e4a2df-c2c3-4da9-95e7-a21ed4f76d38.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/malfet/checkout/silent-checkout'. 2024-04-03T11:36:27.7805101Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/malfet/checkout/silent-checkout/action.yml'. 2024-04-03T11:36:27.8214618Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/aws-actions/configure-aws-credentials/v4/action.yml'. 2024-04-03T11:36:27.8288431Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/aws-actions/amazon-ecr-login/v2/action.yml'. 2024-04-03T11:36:27.8325433Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/calculate-docker-image/action.yml'. 2024-04-03T11:36:27.8572247Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/pull-docker-image/action.yml'. 2024-04-03T11:36:27.8621139Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3/action.yml'. 2024-04-03T11:36:27.8716083Z ##[debug]Set step '__pytorch_pytorch' display name to: 'Checkout PyTorch' 2024-04-03T11:36:27.8718765Z ##[debug]Set step '__self' display name to: 'Setup ROCm' 2024-04-03T11:36:27.8720277Z ##[debug]Set step 'aws_creds' display name to: 'configure aws credentials' 2024-04-03T11:36:27.8721852Z ##[debug]Set step 'login-ecr' display name to: 'Login to Amazon ECR' 2024-04-03T11:36:27.8723471Z ##[debug]Set step 'calculate-docker-image' display name to: 'Calculate docker image' 2024-04-03T11:36:27.8725128Z ##[debug]Set step '__pytorch_test-infra' display name to: 'Pull docker image' 2024-04-03T11:36:27.8726745Z ##[debug]Set step 'monitor-script' display name to: 'Start monitoring script' 2024-04-03T11:36:27.8728312Z ##[debug]Set step '__self_2' display name to: 'Download build artifacts' 2024-04-03T11:36:27.8729802Z ##[debug]Set step '__self_3' display name to: 'Download TD artifacts' 2024-04-03T11:36:27.8731244Z ##[debug]Set step 'parse-ref' display name to: 'Parse ref' 2024-04-03T11:36:27.8732679Z ##[debug]Set step 'get-job-id' display name to: 'Get workflow job id' 2024-04-03T11:36:27.8734500Z ##[debug]Set step 'keep-going' display name to: 'Check for keep-going label and re-enabled test issues' 2024-04-03T11:36:27.8736216Z ##[debug]Set step 'test-timeout' display name to: 'Set Test step time' 2024-04-03T11:36:27.8737571Z ##[debug]Set step 'test' display name to: 'Test' 2024-04-03T11:36:27.8738906Z ##[debug]Set step '__run' display name to: 'Save test results' 2024-04-03T11:36:27.8740656Z ##[debug]Set step '__run_2' display name to: 'Print remaining test logs' 2024-04-03T11:36:27.8742170Z ##[debug]Set step '__run_3' display name to: 'Stop monitoring script' 2024-04-03T11:36:27.8743678Z ##[debug]Set step '__self_4' display name to: 'Upload test artifacts' 2024-04-03T11:36:27.8745316Z ##[debug]Set step '__run_4' display name to: 'Collect backtraces from coredumps (if any)' 2024-04-03T11:36:27.8747040Z ##[debug]Set step '__actions_upload-artifact' display name to: 'Store Core dumps on GitHub' 2024-04-03T11:36:27.8748618Z ##[debug]Set step '__self_5' display name to: 'Teardown ROCm' 2024-04-03T11:36:27.8751600Z Uses: pytorch/pytorch/.github/workflows/_rocm-test.yml@refs/tags/ciflow/trunk/123154 (d253bd7befc64137de913dbb8727b511f1052411) 2024-04-03T11:36:27.8754412Z ##[group] Inputs 2024-04-03T11:36:27.8754884Z build-environment: linux-focal-rocm6.0-py3.8 2024-04-03T11:36:27.8755753Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]} 2024-04-03T11:36:27.8757215Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:36:27.8758314Z sync-tag: 2024-04-03T11:36:27.8759213Z timeout-minutes: 300 2024-04-03T11:36:27.8760166Z tests-to-include: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor 2024-04-03T11:36:27.8761190Z ##[endgroup] 2024-04-03T11:36:27.8762083Z Complete job name: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:36:27.8839727Z ##[debug]Collect running processes for tracking orphan processes. 2024-04-03T11:36:27.9588533Z ##[debug]Finishing: Set up job 2024-04-03T11:36:27.9841205Z ##[debug]Evaluating condition for step: 'Checkout PyTorch' 2024-04-03T11:36:27.9873494Z ##[debug]Evaluating: success() 2024-04-03T11:36:27.9875078Z ##[debug]Evaluating success: 2024-04-03T11:36:27.9887069Z ##[debug]=> true 2024-04-03T11:36:27.9889614Z ##[debug]Result: true 2024-04-03T11:36:27.9908164Z ##[debug]Starting: Checkout PyTorch 2024-04-03T11:36:28.0042191Z ##[debug]Register post job cleanup for action: pytorch/pytorch/.github/actions/checkout-pytorch@main 2024-04-03T11:36:28.0150019Z ##[debug]Loading inputs 2024-04-03T11:36:28.0157608Z ##[debug]Loading env 2024-04-03T11:36:28.0254997Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2024-04-03T11:36:28.0255673Z with: 2024-04-03T11:36:28.0255984Z no-sudo: true 2024-04-03T11:36:28.0256333Z submodules: recursive 2024-04-03T11:36:28.0256730Z fetch-depth: 0 2024-04-03T11:36:28.0257047Z env: 2024-04-03T11:36:28.0257367Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:36:28.0257761Z ##[endgroup] 2024-04-03T11:36:28.0456617Z ##[debug]Evaluating: inputs.no-sudo 2024-04-03T11:36:28.0457207Z ##[debug]Evaluating Index: 2024-04-03T11:36:28.0457673Z ##[debug]..Evaluating inputs: 2024-04-03T11:36:28.0458172Z ##[debug]..=> Object 2024-04-03T11:36:28.0458592Z ##[debug]..Evaluating String: 2024-04-03T11:36:28.0459034Z ##[debug]..=> 'no-sudo' 2024-04-03T11:36:28.0459553Z ##[debug]=> 'true' 2024-04-03T11:36:28.0460308Z ##[debug]Result: 'true' 2024-04-03T11:36:28.0460986Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:36:28.0462796Z ##[debug]Evaluating: success() 2024-04-03T11:36:28.0463410Z ##[debug]Evaluating success: 2024-04-03T11:36:28.0464638Z ##[debug]=> true 2024-04-03T11:36:28.0465289Z ##[debug]Result: true 2024-04-03T11:36:28.0476129Z ##[debug]Starting: run 2024-04-03T11:36:28.0489871Z ##[debug]Loading inputs 2024-04-03T11:36:28.0491091Z ##[debug]Loading env 2024-04-03T11:36:28.0534681Z ##[group]Run retry () { 2024-04-03T11:36:28.0535133Z retry () { 2024-04-03T11:36:28.0535737Z  $* || (sleep 1 && $*) || (sleep 2 && $*) || (sleep 4 && $*) || (sleep 8 && $*) 2024-04-03T11:36:28.0536382Z } 2024-04-03T11:36:28.0536729Z echo "${GITHUB_WORKSPACE}" 2024-04-03T11:36:28.0537209Z if [ -z "${NO_SUDO}" ]; then 2024-04-03T11:36:28.0537749Z  retry sudo rm -rf "${GITHUB_WORKSPACE}" 2024-04-03T11:36:28.0538259Z else 2024-04-03T11:36:28.0538649Z  retry rm -rf "${GITHUB_WORKSPACE}" 2024-04-03T11:36:28.0539134Z fi 2024-04-03T11:36:28.0539471Z mkdir "${GITHUB_WORKSPACE}" 2024-04-03T11:36:28.0589856Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:36:28.0590445Z env: 2024-04-03T11:36:28.0590762Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:36:28.0591168Z NO_SUDO: true 2024-04-03T11:36:28.0591500Z ##[endgroup] 2024-04-03T11:36:28.0737029Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/53f3fbe7-4bba-4c22-86ba-c19f05ad1874.sh 2024-04-03T11:36:28.0839494Z /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:36:30.5825231Z ##[debug]Finished: run 2024-04-03T11:36:30.5832898Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:36:30.5834917Z ##[debug]Evaluating: success() 2024-04-03T11:36:30.5835534Z ##[debug]Evaluating success: 2024-04-03T11:36:30.5836216Z ##[debug]=> true 2024-04-03T11:36:30.5836798Z ##[debug]Result: true 2024-04-03T11:36:30.5837650Z ##[debug]Starting: run 2024-04-03T11:36:30.5870609Z ##[debug]Register post job cleanup for action: malfet/checkout@silent-checkout 2024-04-03T11:36:30.5883312Z ##[debug]Loading inputs 2024-04-03T11:36:30.5891261Z ##[debug]Evaluating: (((github.event_name == 'pull_request') && github.event.pull_request.head.sha) || github.sha) 2024-04-03T11:36:30.5892241Z ##[debug]Evaluating Or: 2024-04-03T11:36:30.5896501Z ##[debug]..Evaluating And: 2024-04-03T11:36:30.5900811Z ##[debug]....Evaluating Equal: 2024-04-03T11:36:30.5902240Z ##[debug]......Evaluating Index: 2024-04-03T11:36:30.5903108Z ##[debug]........Evaluating github: 2024-04-03T11:36:30.5903693Z ##[debug]........=> Object 2024-04-03T11:36:30.5904224Z ##[debug]........Evaluating String: 2024-04-03T11:36:30.5905016Z ##[debug]........=> 'event_name' 2024-04-03T11:36:30.5905714Z ##[debug]......=> 'push' 2024-04-03T11:36:30.5906223Z ##[debug]......Evaluating String: 2024-04-03T11:36:30.5906797Z ##[debug]......=> 'pull_request' 2024-04-03T11:36:30.5912104Z ##[debug]....=> false 2024-04-03T11:36:30.5912839Z ##[debug]..=> false 2024-04-03T11:36:30.5913545Z ##[debug]..Evaluating Index: 2024-04-03T11:36:30.5914079Z ##[debug]....Evaluating github: 2024-04-03T11:36:30.5914636Z ##[debug]....=> Object 2024-04-03T11:36:30.5915034Z ##[debug]....Evaluating String: 2024-04-03T11:36:30.5915490Z ##[debug]....=> 'sha' 2024-04-03T11:36:30.5916009Z ##[debug]..=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:36:30.5916708Z ##[debug]=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:36:30.5923294Z ##[debug]Expanded: ((('push' == 'pull_request') && github['event']['pull_request']['head']['sha']) || 'd253bd7befc64137de913dbb8727b511f1052411') 2024-04-03T11:36:30.5924431Z ##[debug]Result: 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:36:30.5925348Z ##[debug]Evaluating: inputs.fetch-depth 2024-04-03T11:36:30.5925870Z ##[debug]Evaluating Index: 2024-04-03T11:36:30.5926290Z ##[debug]..Evaluating inputs: 2024-04-03T11:36:30.5926729Z ##[debug]..=> Object 2024-04-03T11:36:30.5927137Z ##[debug]..Evaluating String: 2024-04-03T11:36:30.5927587Z ##[debug]..=> 'fetch-depth' 2024-04-03T11:36:30.5928014Z ##[debug]=> '0' 2024-04-03T11:36:30.5928371Z ##[debug]Result: '0' 2024-04-03T11:36:30.5928941Z ##[debug]Evaluating: inputs.submodules 2024-04-03T11:36:30.5929452Z ##[debug]Evaluating Index: 2024-04-03T11:36:30.5929898Z ##[debug]..Evaluating inputs: 2024-04-03T11:36:30.5930352Z ##[debug]..=> Object 2024-04-03T11:36:30.5930745Z ##[debug]..Evaluating String: 2024-04-03T11:36:30.5931189Z ##[debug]..=> 'submodules' 2024-04-03T11:36:30.5931633Z ##[debug]=> 'recursive' 2024-04-03T11:36:30.5932056Z ##[debug]Result: 'recursive' 2024-04-03T11:36:30.5932906Z ##[debug]Evaluating: github.repository 2024-04-03T11:36:30.5933418Z ##[debug]Evaluating Index: 2024-04-03T11:36:30.5933842Z ##[debug]..Evaluating github: 2024-04-03T11:36:30.5934291Z ##[debug]..=> Object 2024-04-03T11:36:30.5934687Z ##[debug]..Evaluating String: 2024-04-03T11:36:30.5935114Z ##[debug]..=> 'repository' 2024-04-03T11:36:30.5935556Z ##[debug]=> 'pytorch/pytorch' 2024-04-03T11:36:30.5936010Z ##[debug]Result: 'pytorch/pytorch' 2024-04-03T11:36:30.5937722Z ##[debug]Evaluating: github.token 2024-04-03T11:36:30.5938190Z ##[debug]Evaluating Index: 2024-04-03T11:36:30.5938616Z ##[debug]..Evaluating github: 2024-04-03T11:36:30.5939041Z ##[debug]..=> Object 2024-04-03T11:36:30.5939445Z ##[debug]..Evaluating String: 2024-04-03T11:36:30.5940063Z ##[debug]..=> 'token' 2024-04-03T11:36:30.5940896Z ##[debug]=> '***' 2024-04-03T11:36:30.5941452Z ##[debug]Result: '***' 2024-04-03T11:36:30.5952018Z ##[debug]Loading env 2024-04-03T11:36:30.5961608Z ##[group]Run malfet/checkout@silent-checkout 2024-04-03T11:36:30.5962106Z with: 2024-04-03T11:36:30.5962472Z ref: d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:36:30.5962975Z fetch-depth: 0 2024-04-03T11:36:30.5963341Z submodules: recursive 2024-04-03T11:36:30.5963731Z quiet-checkout: true 2024-04-03T11:36:30.5964135Z repository: pytorch/pytorch 2024-04-03T11:36:30.5964715Z token: *** 2024-04-03T11:36:30.5965056Z ssh-strict: true 2024-04-03T11:36:30.5965454Z persist-credentials: true 2024-04-03T11:36:30.5965866Z clean: true 2024-04-03T11:36:30.5966225Z sparse-checkout-cone-mode: true 2024-04-03T11:36:30.5966665Z lfs: false 2024-04-03T11:36:30.5967010Z set-safe-directory: true 2024-04-03T11:36:30.5967388Z env: 2024-04-03T11:36:30.5967711Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:36:30.5968100Z ##[endgroup] 2024-04-03T11:36:30.7156612Z ##[debug]GITHUB_WORKSPACE = '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2024-04-03T11:36:30.7159816Z ##[debug]qualified repository = 'pytorch/pytorch' 2024-04-03T11:36:30.7161350Z ##[debug]ref = '' 2024-04-03T11:36:30.7162863Z ##[debug]commit = 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:36:30.7164669Z ##[debug]clean = true 2024-04-03T11:36:30.7165843Z ##[debug]fetch depth = 0 2024-04-03T11:36:30.7167020Z ##[debug]lfs = false 2024-04-03T11:36:30.7168179Z ##[debug]submodules = true 2024-04-03T11:36:30.7169461Z ##[debug]recursive submodules = true 2024-04-03T11:36:30.7170826Z ##[debug]GitHub Host URL = 2024-04-03T11:36:30.7173945Z ::add-matcher::/home/pytorchci/actions-runner/_work/_actions/malfet/checkout/silent-checkout/dist/problem-matcher.json 2024-04-03T11:36:30.7304687Z ##[debug]Added matchers: 'checkout-git'. Problem matchers scan action output for known warning or error strings and report these inline. 2024-04-03T11:36:30.7316313Z Syncing repository: pytorch/pytorch 2024-04-03T11:36:30.7317666Z ::group::Getting Git version info 2024-04-03T11:36:30.7319011Z ##[group]Getting Git version info 2024-04-03T11:36:30.7319957Z Working directory is '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2024-04-03T11:36:30.7320993Z ##[debug]Getting git version 2024-04-03T11:36:30.7321429Z [command]/usr/bin/git version 2024-04-03T11:36:30.7321866Z git version 2.39.0 2024-04-03T11:36:30.7322443Z ##[debug]0 2024-04-03T11:36:30.7323079Z ##[debug]git version 2.39.0 2024-04-03T11:36:30.7323457Z ##[debug] 2024-04-03T11:36:30.7324265Z ##[debug]Set git useragent to: git/2.39.0 (github-actions-checkout) 2024-04-03T11:36:30.7325182Z ::endgroup:: 2024-04-03T11:36:30.7325542Z ##[endgroup] 2024-04-03T11:36:30.7337313Z ::add-mask::*** 2024-04-03T11:36:30.7339122Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/7d30ad05-d58c-4539-946a-4a7817857dbf' before making global git config changes 2024-04-03T11:36:30.7340818Z Adding repository directory to the temporary git global config as a safe directory 2024-04-03T11:36:30.7342424Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:36:30.7344435Z ##[debug]0 2024-04-03T11:36:30.7345113Z ##[debug] 2024-04-03T11:36:30.7356616Z Deleting the contents of '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2024-04-03T11:36:30.7361617Z ::group::Initializing the repository 2024-04-03T11:36:30.7362244Z ##[group]Initializing the repository 2024-04-03T11:36:30.7365705Z [command]/usr/bin/git init /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:36:30.7414272Z hint: Using 'master' as the name for the initial branch. This default branch name 2024-04-03T11:36:30.7416465Z hint: is subject to change. To configure the initial branch name to use in all 2024-04-03T11:36:30.7418414Z hint: of your new repositories, which will suppress this warning, call: 2024-04-03T11:36:30.7419946Z hint: 2024-04-03T11:36:30.7420924Z hint: git config --global init.defaultBranch 2024-04-03T11:36:30.7421944Z hint: 2024-04-03T11:36:30.7423268Z hint: Names commonly chosen instead of 'master' are 'main', 'trunk' and 2024-04-03T11:36:30.7425165Z hint: 'development'. The just-created branch can be renamed via this command: 2024-04-03T11:36:30.7426476Z hint: 2024-04-03T11:36:30.7427167Z hint: git branch -m 2024-04-03T11:36:30.7428827Z Initialized empty Git repository in /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/ 2024-04-03T11:36:30.7431031Z ##[debug]0 2024-04-03T11:36:30.7433011Z ##[debug]Initialized empty Git repository in /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/ 2024-04-03T11:36:30.7434631Z ##[debug] 2024-04-03T11:36:30.7435714Z [command]/usr/bin/git remote add origin https://github.com/pytorch/pytorch 2024-04-03T11:36:30.7488892Z ##[debug]0 2024-04-03T11:36:30.7490103Z ##[debug] 2024-04-03T11:36:30.7491487Z ::endgroup:: 2024-04-03T11:36:30.7492168Z ##[endgroup] 2024-04-03T11:36:30.7493465Z ::group::Disabling automatic garbage collection 2024-04-03T11:36:30.7494677Z ##[group]Disabling automatic garbage collection 2024-04-03T11:36:30.7496834Z [command]/usr/bin/git config --local gc.auto 0 2024-04-03T11:36:30.7549663Z ##[debug]0 2024-04-03T11:36:30.7550763Z ##[debug] 2024-04-03T11:36:30.7551821Z ::endgroup:: 2024-04-03T11:36:30.7552853Z ##[endgroup] 2024-04-03T11:36:30.7553961Z ::group::Setting up auth 2024-04-03T11:36:30.7554725Z ##[group]Setting up auth 2024-04-03T11:36:30.7564280Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-04-03T11:36:30.7623146Z ##[debug]1 2024-04-03T11:36:30.7624704Z ##[debug] 2024-04-03T11:36:30.7632251Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-04-03T11:36:30.8030348Z ##[debug]0 2024-04-03T11:36:30.8031865Z ##[debug] 2024-04-03T11:36:30.8040703Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-04-03T11:36:30.8103135Z ##[debug]1 2024-04-03T11:36:30.8104650Z ##[debug] 2024-04-03T11:36:30.8111447Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-04-03T11:36:30.8513732Z ##[debug]0 2024-04-03T11:36:30.8514792Z ##[debug] 2024-04-03T11:36:30.8534580Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-04-03T11:36:30.8603002Z ##[debug]0 2024-04-03T11:36:30.8604531Z ##[debug] 2024-04-03T11:36:30.8627955Z ::endgroup:: 2024-04-03T11:36:30.8628870Z ##[endgroup] 2024-04-03T11:36:30.8630504Z ::group::Fetching the repository 2024-04-03T11:36:30.8631758Z ##[group]Fetching the repository 2024-04-03T11:36:30.8644624Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --progress --no-recurse-submodules --quiet origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2024-04-03T11:36:36.8525919Z remote: Enumerating objects: 1123818 2024-04-03T11:36:36.8528492Z remote: Enumerating objects: 1124105, done. 2024-04-03T11:36:36.8531132Z remote: Counting objects: 0% (1/287) 2024-04-03T11:36:36.8533454Z remote: Counting objects: 1% (3/287) 2024-04-03T11:36:36.8535694Z remote: Counting objects: 2% (6/287) 2024-04-03T11:36:36.8537991Z remote: Counting objects: 3% (9/287) 2024-04-03T11:36:36.8540507Z remote: Counting objects: 4% (12/287) 2024-04-03T11:36:36.8543114Z remote: Counting objects: 5% (15/287) 2024-04-03T11:36:36.8545460Z remote: Counting objects: 6% (18/287) 2024-04-03T11:36:36.8547840Z remote: Counting objects: 7% (21/287) 2024-04-03T11:36:36.8550252Z remote: Counting objects: 8% (23/287) 2024-04-03T11:36:36.8552609Z remote: Counting objects: 9% (26/287) 2024-04-03T11:36:36.8554902Z remote: Counting objects: 10% (29/287) 2024-04-03T11:36:36.8557188Z remote: Counting objects: 11% (32/287) 2024-04-03T11:36:36.8559542Z remote: Counting objects: 12% (35/287) 2024-04-03T11:36:36.8562007Z remote: Counting objects: 13% (38/287) 2024-04-03T11:36:36.8564393Z remote: Counting objects: 14% (41/287) 2024-04-03T11:36:36.8566775Z remote: Counting objects: 15% (44/287) 2024-04-03T11:36:36.8569113Z remote: Counting objects: 16% (46/287) 2024-04-03T11:36:36.8571483Z remote: Counting objects: 17% (49/287) 2024-04-03T11:36:36.8573875Z remote: Counting objects: 18% (52/287) 2024-04-03T11:36:36.8576402Z remote: Counting objects: 19% (55/287) 2024-04-03T11:36:36.8578873Z remote: Counting objects: 20% (58/287) 2024-04-03T11:36:36.8581594Z remote: Counting objects: 21% (61/287) 2024-04-03T11:36:36.8584121Z remote: Counting objects: 22% (64/287) 2024-04-03T11:36:36.8586587Z remote: Counting objects: 23% (67/287) 2024-04-03T11:36:36.8588998Z remote: Counting objects: 24% (69/287) 2024-04-03T11:36:36.8591421Z remote: Counting objects: 25% (72/287) 2024-04-03T11:36:36.8594636Z remote: Counting objects: 26% (75/287) 2024-04-03T11:36:36.8596965Z remote: Counting objects: 27% (78/287) 2024-04-03T11:36:36.8599766Z remote: Counting objects: 28% (81/287) 2024-04-03T11:36:36.8602232Z remote: Counting objects: 29% (84/287) 2024-04-03T11:36:36.8604438Z remote: Counting objects: 30% (87/287) 2024-04-03T11:36:36.8606672Z remote: Counting objects: 31% (89/287) 2024-04-03T11:36:36.8608983Z remote: Counting objects: 32% (92/287) 2024-04-03T11:36:36.8611301Z remote: Counting objects: 33% (95/287) 2024-04-03T11:36:36.8613600Z remote: Counting objects: 34% (98/287) 2024-04-03T11:36:36.8615897Z remote: Counting objects: 35% (101/287) 2024-04-03T11:36:36.8618355Z remote: Counting objects: 36% (104/287) 2024-04-03T11:36:36.8620848Z remote: Counting objects: 37% (107/287) 2024-04-03T11:36:36.8622396Z remote: Counting objects: 38% (110/287) 2024-04-03T11:36:36.8623964Z remote: Counting objects: 39% (112/287) 2024-04-03T11:36:36.8625482Z remote: Counting objects: 40% (115/287) 2024-04-03T11:36:36.8627014Z remote: Counting objects: 41% (118/287) 2024-04-03T11:36:36.8628524Z remote: Counting objects: 42% (121/287) 2024-04-03T11:36:36.8630040Z remote: Counting objects: 43% (124/287) 2024-04-03T11:36:36.8631587Z remote: Counting objects: 44% (127/287) 2024-04-03T11:36:36.8633104Z remote: Counting objects: 45% (130/287) 2024-04-03T11:36:36.8634300Z remote: Counting objects: 46% (133/287) 2024-04-03T11:36:36.8634973Z remote: Counting objects: 47% (135/287) 2024-04-03T11:36:36.8635557Z remote: Counting objects: 48% (138/287) 2024-04-03T11:36:36.8636129Z remote: Counting objects: 49% (141/287) 2024-04-03T11:36:36.8636711Z remote: Counting objects: 50% (144/287) 2024-04-03T11:36:36.8637303Z remote: Counting objects: 51% (147/287) 2024-04-03T11:36:36.8637908Z remote: Counting objects: 52% (150/287) 2024-04-03T11:36:36.8638961Z remote: Counting objects: 53% (153/287) 2024-04-03T11:36:36.8639777Z remote: Counting objects: 54% (155/287) 2024-04-03T11:36:36.8640463Z remote: Counting objects: 55% (158/287) 2024-04-03T11:36:36.8641157Z remote: Counting objects: 56% (161/287) 2024-04-03T11:36:36.8641845Z remote: Counting objects: 57% (164/287) 2024-04-03T11:36:36.8642521Z remote: Counting objects: 58% (167/287) 2024-04-03T11:36:36.8643187Z remote: Counting objects: 59% (170/287) 2024-04-03T11:36:36.8643865Z remote: Counting objects: 60% (173/287) 2024-04-03T11:36:36.8644560Z remote: Counting objects: 61% (176/287) 2024-04-03T11:36:36.8645245Z remote: Counting objects: 62% (178/287) 2024-04-03T11:36:36.8645939Z remote: Counting objects: 63% (181/287) 2024-04-03T11:36:36.8646637Z remote: Counting objects: 64% (184/287) 2024-04-03T11:36:36.8647323Z remote: Counting objects: 65% (187/287) 2024-04-03T11:36:36.8648017Z remote: Counting objects: 66% (190/287) 2024-04-03T11:36:36.8648713Z remote: Counting objects: 67% (193/287) 2024-04-03T11:36:36.8649394Z remote: Counting objects: 68% (196/287) 2024-04-03T11:36:36.8650085Z remote: Counting objects: 69% (199/287) 2024-04-03T11:36:36.8650782Z remote: Counting objects: 70% (201/287) 2024-04-03T11:36:36.8651466Z remote: Counting objects: 71% (204/287) 2024-04-03T11:36:36.8652168Z remote: Counting objects: 72% (207/287) 2024-04-03T11:36:36.8652863Z remote: Counting objects: 73% (210/287) 2024-04-03T11:36:36.8653560Z remote: Counting objects: 74% (213/287) 2024-04-03T11:36:36.8654229Z remote: Counting objects: 75% (216/287) 2024-04-03T11:36:36.8654808Z remote: Counting objects: 76% (219/287) 2024-04-03T11:36:36.8655374Z remote: Counting objects: 77% (221/287) 2024-04-03T11:36:36.8656126Z remote: Counting objects: 78% (224/287) 2024-04-03T11:36:36.8656720Z remote: Counting objects: 79% (227/287) 2024-04-03T11:36:36.8657289Z remote: Counting objects: 80% (230/287) 2024-04-03T11:36:36.8657977Z remote: Counting objects: 81% (233/287) 2024-04-03T11:36:36.8658566Z remote: Counting objects: 82% (236/287) 2024-04-03T11:36:36.8659147Z remote: Counting objects: 83% (239/287) 2024-04-03T11:36:36.8659805Z remote: Counting objects: 84% (242/287) 2024-04-03T11:36:36.8660388Z remote: Counting objects: 85% (244/287) 2024-04-03T11:36:36.8660964Z remote: Counting objects: 86% (247/287) 2024-04-03T11:36:36.8661522Z remote: Counting objects: 87% (250/287) 2024-04-03T11:36:36.8662095Z remote: Counting objects: 88% (253/287) 2024-04-03T11:36:36.8662672Z remote: Counting objects: 89% (256/287) 2024-04-03T11:36:36.8663237Z remote: Counting objects: 90% (259/287) 2024-04-03T11:36:36.8663825Z remote: Counting objects: 91% (262/287) 2024-04-03T11:36:36.8664405Z remote: Counting objects: 92% (265/287) 2024-04-03T11:36:36.8664985Z remote: Counting objects: 93% (267/287) 2024-04-03T11:36:36.8665570Z remote: Counting objects: 94% (270/287) 2024-04-03T11:36:36.8666153Z remote: Counting objects: 95% (273/287) 2024-04-03T11:36:36.8666716Z remote: Counting objects: 96% (276/287) 2024-04-03T11:36:36.8667287Z remote: Counting objects: 97% (279/287) 2024-04-03T11:36:36.8667864Z remote: Counting objects: 98% (282/287) 2024-04-03T11:36:36.8668430Z remote: Counting objects: 99% (285/287) 2024-04-03T11:36:36.8669004Z remote: Counting objects: 100% (287/287) 2024-04-03T11:36:36.8669631Z remote: Counting objects: 100% (287/287), done. 2024-04-03T11:36:36.8670249Z remote: Compressing objects: 0% (1/176) 2024-04-03T11:36:36.8670855Z remote: Compressing objects: 1% (2/176) 2024-04-03T11:36:36.8717141Z remote: Compressing objects: 2% (4/176) 2024-04-03T11:36:36.8845649Z remote: Compressing objects: 3% (6/176) 2024-04-03T11:36:36.8900704Z remote: Compressing objects: 4% (8/176) 2024-04-03T11:36:36.8992139Z remote: Compressing objects: 5% (9/176) 2024-04-03T11:36:36.9134984Z remote: Compressing objects: 6% (11/176) 2024-04-03T11:36:36.9245413Z remote: Compressing objects: 7% (13/176) 2024-04-03T11:36:36.9277844Z remote: Compressing objects: 8% (15/176) 2024-04-03T11:36:36.9324931Z remote: Compressing objects: 9% (16/176) 2024-04-03T11:36:36.9395218Z remote: Compressing objects: 10% (18/176) 2024-04-03T11:36:36.9463333Z remote: Compressing objects: 11% (20/176) 2024-04-03T11:36:36.9475085Z remote: Compressing objects: 12% (22/176) 2024-04-03T11:36:36.9491044Z remote: Compressing objects: 13% (23/176) 2024-04-03T11:36:36.9527548Z remote: Compressing objects: 14% (25/176) 2024-04-03T11:36:36.9558773Z remote: Compressing objects: 15% (27/176) 2024-04-03T11:36:36.9562254Z remote: Compressing objects: 16% (29/176) 2024-04-03T11:36:36.9593233Z remote: Compressing objects: 17% (30/176) 2024-04-03T11:36:36.9611452Z remote: Compressing objects: 18% (32/176) 2024-04-03T11:36:36.9613281Z remote: Compressing objects: 19% (34/176) 2024-04-03T11:36:36.9614895Z remote: Compressing objects: 20% (36/176) 2024-04-03T11:36:36.9619238Z remote: Compressing objects: 21% (37/176) 2024-04-03T11:36:36.9621060Z remote: Compressing objects: 22% (39/176) 2024-04-03T11:36:36.9622677Z remote: Compressing objects: 23% (41/176) 2024-04-03T11:36:36.9624279Z remote: Compressing objects: 24% (43/176) 2024-04-03T11:36:36.9625874Z remote: Compressing objects: 25% (44/176) 2024-04-03T11:36:36.9627484Z remote: Compressing objects: 26% (46/176) 2024-04-03T11:36:36.9629079Z remote: Compressing objects: 27% (48/176) 2024-04-03T11:36:36.9631305Z remote: Compressing objects: 28% (50/176) 2024-04-03T11:36:36.9632953Z remote: Compressing objects: 29% (52/176) 2024-04-03T11:36:36.9639599Z remote: Compressing objects: 30% (53/176) 2024-04-03T11:36:36.9641206Z remote: Compressing objects: 31% (55/176) 2024-04-03T11:36:36.9642820Z remote: Compressing objects: 32% (57/176) 2024-04-03T11:36:36.9645191Z remote: Compressing objects: 33% (59/176) 2024-04-03T11:36:36.9648779Z remote: Compressing objects: 34% (60/176) 2024-04-03T11:36:36.9652506Z remote: Compressing objects: 35% (62/176) 2024-04-03T11:36:36.9654516Z remote: Compressing objects: 36% (64/176) 2024-04-03T11:36:36.9656518Z remote: Compressing objects: 37% (66/176) 2024-04-03T11:36:36.9657719Z remote: Compressing objects: 38% (67/176) 2024-04-03T11:36:36.9659454Z remote: Compressing objects: 39% (69/176) 2024-04-03T11:36:36.9661787Z remote: Compressing objects: 40% (71/176) 2024-04-03T11:36:36.9663505Z remote: Compressing objects: 41% (73/176) 2024-04-03T11:36:36.9665183Z remote: Compressing objects: 42% (74/176) 2024-04-03T11:36:36.9666911Z remote: Compressing objects: 43% (76/176) 2024-04-03T11:36:36.9677621Z remote: Compressing objects: 44% (78/176) 2024-04-03T11:36:36.9685130Z remote: Compressing objects: 45% (80/176) 2024-04-03T11:36:36.9686638Z remote: Compressing objects: 46% (81/176) 2024-04-03T11:36:36.9688188Z remote: Compressing objects: 47% (83/176) 2024-04-03T11:36:36.9689699Z remote: Compressing objects: 48% (85/176) 2024-04-03T11:36:36.9691159Z remote: Compressing objects: 49% (87/176) 2024-04-03T11:36:36.9692610Z remote: Compressing objects: 50% (88/176) 2024-04-03T11:36:36.9693784Z remote: Compressing objects: 51% (90/176) 2024-04-03T11:36:36.9694930Z remote: Compressing objects: 52% (92/176) 2024-04-03T11:36:36.9696072Z remote: Compressing objects: 53% (94/176) 2024-04-03T11:36:36.9697235Z remote: Compressing objects: 54% (96/176) 2024-04-03T11:36:36.9698385Z remote: Compressing objects: 55% (97/176) 2024-04-03T11:36:36.9699538Z remote: Compressing objects: 56% (99/176) 2024-04-03T11:36:36.9701092Z remote: Compressing objects: 57% (101/176) 2024-04-03T11:36:36.9702549Z remote: Compressing objects: 58% (103/176) 2024-04-03T11:36:36.9703717Z remote: Compressing objects: 59% (104/176) 2024-04-03T11:36:36.9704858Z remote: Compressing objects: 60% (106/176) 2024-04-03T11:36:36.9706011Z remote: Compressing objects: 61% (108/176) 2024-04-03T11:36:36.9711357Z remote: Compressing objects: 62% (110/176) 2024-04-03T11:36:36.9713266Z remote: Compressing objects: 63% (111/176) 2024-04-03T11:36:36.9714725Z remote: Compressing objects: 64% (113/176) 2024-04-03T11:36:36.9715986Z remote: Compressing objects: 65% (115/176) 2024-04-03T11:36:36.9717455Z remote: Compressing objects: 66% (117/176) 2024-04-03T11:36:36.9718593Z remote: Compressing objects: 67% (118/176) 2024-04-03T11:36:36.9719734Z remote: Compressing objects: 68% (120/176) 2024-04-03T11:36:36.9720888Z remote: Compressing objects: 69% (122/176) 2024-04-03T11:36:36.9722007Z remote: Compressing objects: 70% (124/176) 2024-04-03T11:36:36.9723138Z remote: Compressing objects: 71% (125/176) 2024-04-03T11:36:36.9724282Z remote: Compressing objects: 72% (127/176) 2024-04-03T11:36:36.9725406Z remote: Compressing objects: 73% (129/176) 2024-04-03T11:36:36.9726555Z remote: Compressing objects: 74% (131/176) 2024-04-03T11:36:36.9728126Z remote: Compressing objects: 75% (132/176) 2024-04-03T11:36:36.9729579Z remote: Compressing objects: 76% (134/176) 2024-04-03T11:36:36.9730812Z remote: Compressing objects: 77% (136/176) 2024-04-03T11:36:36.9731949Z remote: Compressing objects: 78% (138/176) 2024-04-03T11:36:36.9733502Z remote: Compressing objects: 79% (140/176) 2024-04-03T11:36:36.9734662Z remote: Compressing objects: 80% (141/176) 2024-04-03T11:36:36.9736028Z remote: Compressing objects: 81% (143/176) 2024-04-03T11:36:36.9737157Z remote: Compressing objects: 82% (145/176) 2024-04-03T11:36:36.9738277Z remote: Compressing objects: 83% (147/176) 2024-04-03T11:36:36.9739408Z remote: Compressing objects: 84% (148/176) 2024-04-03T11:36:36.9740672Z remote: Compressing objects: 85% (150/176) 2024-04-03T11:36:36.9741793Z remote: Compressing objects: 86% (152/176) 2024-04-03T11:36:36.9742950Z remote: Compressing objects: 87% (154/176) 2024-04-03T11:36:36.9744084Z remote: Compressing objects: 88% (155/176) 2024-04-03T11:36:36.9745201Z remote: Compressing objects: 89% (157/176) 2024-04-03T11:36:36.9746338Z remote: Compressing objects: 90% (159/176) 2024-04-03T11:36:36.9747465Z remote: Compressing objects: 91% (161/176) 2024-04-03T11:36:36.9748611Z remote: Compressing objects: 92% (162/176) 2024-04-03T11:36:36.9749731Z remote: Compressing objects: 93% (164/176) 2024-04-03T11:36:36.9750877Z remote: Compressing objects: 94% (166/176) 2024-04-03T11:36:36.9752014Z remote: Compressing objects: 95% (168/176) 2024-04-03T11:36:36.9753135Z remote: Compressing objects: 96% (169/176) 2024-04-03T11:36:36.9754266Z remote: Compressing objects: 97% (171/176) 2024-04-03T11:36:36.9755410Z remote: Compressing objects: 98% (173/176) 2024-04-03T11:36:36.9756532Z remote: Compressing objects: 99% (175/176) 2024-04-03T11:36:36.9757669Z remote: Compressing objects: 100% (176/176) 2024-04-03T11:36:36.9758890Z remote: Compressing objects: 100% (176/176), done. 2024-04-03T11:37:03.3989147Z remote: Total 1124105 (delta 147), reused 234 (delta 111), pack-reused 1123818 2024-04-03T11:37:32.9798235Z ##[debug]0 2024-04-03T11:37:32.9799734Z ##[debug] 2024-04-03T11:37:32.9815202Z [command]/usr/bin/git rev-parse --verify --quiet d253bd7befc64137de913dbb8727b511f1052411^{object} 2024-04-03T11:37:32.9880700Z d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:37:32.9886024Z ##[debug]0 2024-04-03T11:37:32.9888106Z ##[debug]d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:37:32.9889739Z ##[debug] 2024-04-03T11:37:32.9891383Z ::endgroup:: 2024-04-03T11:37:32.9892274Z ##[endgroup] 2024-04-03T11:37:32.9893975Z ::group::Determining the checkout info 2024-04-03T11:37:32.9895407Z ##[group]Determining the checkout info 2024-04-03T11:37:32.9897618Z ::endgroup:: 2024-04-03T11:37:32.9898511Z ##[endgroup] 2024-04-03T11:37:32.9900831Z ::group::Checking out the ref 2024-04-03T11:37:32.9902032Z ##[group]Checking out the ref 2024-04-03T11:37:32.9904606Z [command]/usr/bin/git checkout --quiet --force d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:37:34.5426838Z ##[debug]0 2024-04-03T11:37:34.5428356Z ##[debug] 2024-04-03T11:37:34.5432995Z ::endgroup:: 2024-04-03T11:37:34.5433972Z ##[endgroup] 2024-04-03T11:37:34.5435771Z ::group::Setting up auth for fetching submodules 2024-04-03T11:37:34.5437407Z ##[group]Setting up auth for fetching submodules 2024-04-03T11:37:34.5440525Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-04-03T11:37:34.5501221Z ##[debug]0 2024-04-03T11:37:34.5502743Z ##[debug] 2024-04-03T11:37:34.5521433Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2024-04-03T11:37:34.5568115Z ##[debug]5 2024-04-03T11:37:34.5569681Z ##[debug] 2024-04-03T11:37:34.5572253Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2024-04-03T11:37:34.5620701Z ##[debug]0 2024-04-03T11:37:34.5622285Z ##[debug] 2024-04-03T11:37:34.5625166Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2024-04-03T11:37:34.5674826Z ##[debug]0 2024-04-03T11:37:34.5676400Z ##[debug] 2024-04-03T11:37:34.5678702Z ::endgroup:: 2024-04-03T11:37:34.5679644Z ##[endgroup] 2024-04-03T11:37:34.5681294Z ::group::Fetching submodules 2024-04-03T11:37:34.5682489Z ##[group]Fetching submodules 2024-04-03T11:37:34.5684561Z [command]/usr/bin/git submodule sync --recursive 2024-04-03T11:37:34.6151633Z ##[debug]0 2024-04-03T11:37:34.6153144Z ##[debug] 2024-04-03T11:37:34.6162395Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2024-04-03T11:37:34.6663405Z Submodule 'android/libs/fbjni' (https://github.com/facebookincubator/fbjni.git) registered for path 'android/libs/fbjni' 2024-04-03T11:37:34.6668825Z Submodule 'third_party/NNPACK_deps/FP16' (https://github.com/Maratyszcza/FP16.git) registered for path 'third_party/FP16' 2024-04-03T11:37:34.6676681Z Submodule 'third_party/NNPACK_deps/FXdiv' (https://github.com/Maratyszcza/FXdiv.git) registered for path 'third_party/FXdiv' 2024-04-03T11:37:34.6682429Z Submodule 'third_party/NNPACK' (https://github.com/Maratyszcza/NNPACK.git) registered for path 'third_party/NNPACK' 2024-04-03T11:37:34.6687611Z Submodule 'third_party/QNNPACK' (https://github.com/pytorch/QNNPACK) registered for path 'third_party/QNNPACK' 2024-04-03T11:37:34.6694582Z Submodule 'third_party/VulkanMemoryAllocator' (https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator.git) registered for path 'third_party/VulkanMemoryAllocator' 2024-04-03T11:37:34.6701760Z Submodule 'third_party/XNNPACK' (https://github.com/google/XNNPACK.git) registered for path 'third_party/XNNPACK' 2024-04-03T11:37:34.6706874Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/benchmark' 2024-04-03T11:37:34.6712047Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo.git) registered for path 'third_party/cpuinfo' 2024-04-03T11:37:34.6715750Z Submodule 'third_party/cub' (https://github.com/NVlabs/cub.git) registered for path 'third_party/cub' 2024-04-03T11:37:34.6719541Z Submodule 'third_party/cudnn_frontend' (https://github.com/NVIDIA/cudnn-frontend.git) registered for path 'third_party/cudnn_frontend' 2024-04-03T11:37:34.6723491Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/cutlass' 2024-04-03T11:37:34.6726887Z Submodule 'third_party/eigen' (https://gitlab.com/libeigen/eigen.git) registered for path 'third_party/eigen' 2024-04-03T11:37:34.6730248Z Submodule 'third_party/fbgemm' (https://github.com/pytorch/fbgemm) registered for path 'third_party/fbgemm' 2024-04-03T11:37:34.6734047Z Submodule 'third_party/flatbuffers' (https://github.com/google/flatbuffers.git) registered for path 'third_party/flatbuffers' 2024-04-03T11:37:34.6737664Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/fmt' 2024-04-03T11:37:34.6741159Z Submodule 'third_party/foxi' (https://github.com/houseroad/foxi.git) registered for path 'third_party/foxi' 2024-04-03T11:37:34.6745021Z Submodule 'third_party/gemmlowp/gemmlowp' (https://github.com/google/gemmlowp.git) registered for path 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:37:34.6749047Z Submodule 'third_party/gloo' (https://github.com/facebookincubator/gloo) registered for path 'third_party/gloo' 2024-04-03T11:37:34.6752764Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/googletest' 2024-04-03T11:37:34.6756013Z Submodule 'third_party/ideep' (https://github.com/intel/ideep) registered for path 'third_party/ideep' 2024-04-03T11:37:34.6759666Z Submodule 'third_party/ios-cmake' (https://github.com/Yangqing/ios-cmake.git) registered for path 'third_party/ios-cmake' 2024-04-03T11:37:34.6763451Z Submodule 'third_party/ittapi' (https://github.com/intel/ittapi.git) registered for path 'third_party/ittapi' 2024-04-03T11:37:34.6768896Z Submodule 'third_party/kineto' (https://github.com/pytorch/kineto) registered for path 'third_party/kineto' 2024-04-03T11:37:34.6774927Z Submodule 'third_party/mimalloc' (https://github.com/microsoft/mimalloc.git) registered for path 'third_party/mimalloc' 2024-04-03T11:37:34.6779974Z Submodule 'third_party/nccl/nccl' (https://github.com/NVIDIA/nccl) registered for path 'third_party/nccl/nccl' 2024-04-03T11:37:34.6785930Z Submodule 'third_party/neon2sse' (https://github.com/intel/ARM_NEON_2_x86_SSE.git) registered for path 'third_party/neon2sse' 2024-04-03T11:37:34.6791502Z Submodule 'third_party/nlohmann' (https://github.com/nlohmann/json.git) registered for path 'third_party/nlohmann' 2024-04-03T11:37:34.6796864Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx' 2024-04-03T11:37:34.6802950Z Submodule 'third_party/onnx-tensorrt' (https://github.com/onnx/onnx-tensorrt) registered for path 'third_party/onnx-tensorrt' 2024-04-03T11:37:34.6808667Z Submodule 'third_party/pocketfft' (https://github.com/mreineck/pocketfft) registered for path 'third_party/pocketfft' 2024-04-03T11:37:34.6814825Z Submodule 'third_party/protobuf' (https://github.com/protocolbuffers/protobuf.git) registered for path 'third_party/protobuf' 2024-04-03T11:37:34.6820871Z Submodule 'third_party/NNPACK_deps/psimd' (https://github.com/Maratyszcza/psimd.git) registered for path 'third_party/psimd' 2024-04-03T11:37:34.6827374Z Submodule 'third_party/NNPACK_deps/pthreadpool' (https://github.com/Maratyszcza/pthreadpool.git) registered for path 'third_party/pthreadpool' 2024-04-03T11:37:34.6832937Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/pybind11' 2024-04-03T11:37:34.6838629Z Submodule 'third_party/python-peachpy' (https://github.com/malfet/PeachPy.git) registered for path 'third_party/python-peachpy' 2024-04-03T11:37:34.6844783Z Submodule 'third_party/sleef' (https://github.com/shibatch/sleef) registered for path 'third_party/sleef' 2024-04-03T11:37:34.6851166Z Submodule 'third_party/tbb' (https://github.com/01org/tbb) registered for path 'third_party/tbb' 2024-04-03T11:37:34.6858290Z Submodule 'third_party/tensorpipe' (https://github.com/pytorch/tensorpipe.git) registered for path 'third_party/tensorpipe' 2024-04-03T11:37:34.6865362Z Submodule 'third_party/zstd' (https://github.com/facebook/zstd.git) registered for path 'third_party/zstd' 2024-04-03T11:37:34.6902434Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/android/libs/fbjni'... 2024-04-03T11:37:35.6930132Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/FP16'... 2024-04-03T11:37:36.4838091Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/FXdiv'... 2024-04-03T11:37:37.1574297Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/NNPACK'... 2024-04-03T11:37:38.2508040Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/QNNPACK'... 2024-04-03T11:37:39.2682770Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/VulkanMemoryAllocator'... 2024-04-03T11:37:42.4032794Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/XNNPACK'... 2024-04-03T11:37:55.0956073Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/benchmark'... 2024-04-03T11:37:56.3767833Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cpuinfo'... 2024-04-03T11:37:58.0599126Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cub'... 2024-04-03T11:38:00.5760441Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cudnn_frontend'... 2024-04-03T11:38:03.0705156Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cutlass'... 2024-04-03T11:38:07.1847404Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/eigen'... 2024-04-03T11:38:17.1636627Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm'... 2024-04-03T11:38:19.4536758Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/flatbuffers'... 2024-04-03T11:38:22.3913202Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fmt'... 2024-04-03T11:38:24.6919154Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/foxi'... 2024-04-03T11:38:25.4220725Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/gemmlowp/gemmlowp'... 2024-04-03T11:38:26.7284647Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/gloo'... 2024-04-03T11:38:27.9544462Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/googletest'... 2024-04-03T11:38:30.1429768Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ideep'... 2024-04-03T11:38:31.3402209Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ios-cmake'... 2024-04-03T11:38:32.0217509Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ittapi'... 2024-04-03T11:38:32.9829999Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto'... 2024-04-03T11:38:35.8466598Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/mimalloc'... 2024-04-03T11:38:37.5949612Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/nccl/nccl'... 2024-04-03T11:38:39.1650136Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/neon2sse'... 2024-04-03T11:38:40.1648967Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/nlohmann'... 2024-04-03T11:38:48.3046707Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx'... 2024-04-03T11:38:51.8310577Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx-tensorrt'... 2024-04-03T11:38:53.1904172Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pocketfft'... 2024-04-03T11:38:54.0854773Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf'... 2024-04-03T11:39:04.9452887Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/psimd'... 2024-04-03T11:39:05.6263269Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pthreadpool'... 2024-04-03T11:39:06.4658699Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pybind11'... 2024-04-03T11:39:08.4656120Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/python-peachpy'... 2024-04-03T11:39:09.5586250Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/sleef'... 2024-04-03T11:39:11.0821733Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tbb'... 2024-04-03T11:39:14.5607650Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe'... 2024-04-03T11:39:15.8900620Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/zstd'... 2024-04-03T11:39:19.5811409Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2024-04-03T11:39:19.5942395Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2024-04-03T11:39:19.6058894Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2024-04-03T11:39:19.6405804Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2024-04-03T11:39:19.6767837Z Submodule path 'third_party/QNNPACK': checked out '7d2a4e9931a82adc3814275b6219a03e24e36b4c' 2024-04-03T11:39:19.7191927Z Submodule path 'third_party/VulkanMemoryAllocator': checked out 'a6bfc237255a6bac1513f7c1ebde6d8aed6b5191' 2024-04-03T11:39:20.6775356Z Submodule path 'third_party/XNNPACK': checked out 'fcbf55af6cf28a4627bcd1f703ab7ad843f0f3a2' 2024-04-03T11:39:20.7021609Z Submodule path 'third_party/benchmark': checked out '0d98dba29d66e93259db7daa53a9327df767a415' 2024-04-03T11:39:20.8114485Z Submodule path 'third_party/cpuinfo': checked out 'd6860c477c99f1fce9e28eb206891af3c0e1a1d7' 2024-04-03T11:39:20.8487630Z Submodule path 'third_party/cub': checked out 'd106ddb991a56c3df1b6d51b2409e36ba8181ce4' 2024-04-03T11:39:20.8850494Z Submodule path 'third_party/cudnn_frontend': checked out '150798fe976556078f443fdb059a1ff0361f58a2' 2024-04-03T11:39:21.4005243Z Submodule path 'third_party/cutlass': checked out 'bbe579a9e3beb6ea6626d9227ec32d0dae119a49' 2024-04-03T11:39:21.6572328Z Submodule path 'third_party/eigen': checked out '3147391d946bb4b6c68edd901f2add6ac1f31f8c' 2024-04-03T11:39:21.7271270Z Submodule path 'third_party/fbgemm': checked out 'dbc3157bf256f1339b3fa1fef2be89ac4078be0e' 2024-04-03T11:39:21.7295809Z Submodule 'third_party/asmjit' (https://github.com/asmjit/asmjit.git) registered for path 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:39:21.7300807Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo) registered for path 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:39:21.7305474Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:39:21.7310208Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:39:21.7315428Z Submodule 'third_party/hipify_torch' (https://github.com/ROCmSoftwarePlatform/hipify_torch.git) registered for path 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:39:21.7362755Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/asmjit'... 2024-04-03T11:39:23.7704397Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cpuinfo'... 2024-04-03T11:39:25.3076012Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cutlass'... 2024-04-03T11:39:29.5301386Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/googletest'... 2024-04-03T11:39:31.7511988Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/hipify_torch'... 2024-04-03T11:39:32.7373528Z Submodule path 'third_party/fbgemm/third_party/asmjit': checked out 'd3fbf7c9bc7c1d1365a94a45614b91c5a3706b81' 2024-04-03T11:39:32.8394080Z Submodule path 'third_party/fbgemm/third_party/cpuinfo': checked out 'ed8b86a253800bafdb7b25c5c399f91bff9cb1f3' 2024-04-03T11:39:33.2643505Z Submodule path 'third_party/fbgemm/third_party/cutlass': checked out 'fc9ebc645b63f3a6bc80aaefde5c063fb72110d6' 2024-04-03T11:39:33.3400726Z Submodule path 'third_party/fbgemm/third_party/googletest': checked out 'cbf019de22c8dd37b2108da35b2748fd702d1796' 2024-04-03T11:39:33.3591149Z Submodule path 'third_party/fbgemm/third_party/hipify_torch': checked out '23f53b025b466d8ec3c45d52290d3442f7fbe6b1' 2024-04-03T11:39:33.4792955Z Submodule path 'third_party/flatbuffers': checked out '01834de25e4bf3975a9a00e816292b1ad0fe184b' 2024-04-03T11:39:33.5179894Z Submodule path 'third_party/fmt': checked out 'e69e5f977d458f2650bb346dadf2ad30c5320281' 2024-04-03T11:39:33.5378282Z Submodule path 'third_party/foxi': checked out 'c278588e34e535f0bb8f00df3880d26928038cad' 2024-04-03T11:39:33.5848389Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2024-04-03T11:39:33.6198624Z Submodule path 'third_party/gloo': checked out '5354032ea08eadd7fc4456477f7f7c6308818509' 2024-04-03T11:39:33.6751198Z Submodule path 'third_party/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-04-03T11:39:33.6965907Z Submodule path 'third_party/ideep': checked out '8a6cc4e09dc509f04f83c085e38786b1fb44e14d' 2024-04-03T11:39:33.7000890Z Submodule 'mkl-dnn' (https://github.com/intel/mkl-dnn.git) registered for path 'third_party/ideep/mkl-dnn' 2024-04-03T11:39:33.7056924Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ideep/mkl-dnn'... 2024-04-03T11:39:48.2672036Z Submodule path 'third_party/ideep/mkl-dnn': checked out '86e6af5974177e513fd3fee58425e1063e7f1361' 2024-04-03T11:39:48.2881437Z Submodule path 'third_party/ios-cmake': checked out '8abaed637d56f1337d6e1d2c4026e25c1eade724' 2024-04-03T11:39:48.3125341Z Submodule path 'third_party/ittapi': checked out '5b8a7d7422611c3a0d799fb5fc5dd4abfae35b42' 2024-04-03T11:39:48.4227982Z Submodule path 'third_party/kineto': checked out '8466a8b111b36dc725e6855d52a0b133d925a8e0' 2024-04-03T11:39:48.4268020Z Submodule 'libkineto/third_party/dynolog' (https://github.com/facebookincubator/dynolog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:39:48.4275483Z Submodule 'libkineto/third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:39:48.4284270Z Submodule 'libkineto/third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:39:48.4332171Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog'... 2024-04-03T11:39:50.0911504Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/fmt'... 2024-04-03T11:39:52.3731770Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/googletest'... 2024-04-03T11:39:54.6426167Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out '7d04a0053a845370ae06ce317a22a48e9edcc74e' 2024-04-03T11:39:54.6445560Z Submodule 'third_party/DCGM' (https://github.com/NVIDIA/DCGM.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:39:54.6452910Z Submodule 'third_party/cpr' (https://github.com/libcpr/cpr.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:39:54.6460341Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:39:54.6467000Z Submodule 'third_party/gflags' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:39:54.6473020Z Submodule 'third_party/glog' (https://github.com/google/glog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:39:54.6478805Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:39:54.6484546Z Submodule 'third_party/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:39:54.6489970Z Submodule 'third_party/pfs' (https://github.com/dtrugman/pfs.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:39:54.6522122Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM'... 2024-04-03T11:39:56.7009209Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/cpr'... 2024-04-03T11:39:57.8621892Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/fmt'... 2024-04-03T11:40:00.3157539Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags'... 2024-04-03T11:40:01.5080377Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/glog'... 2024-04-03T11:40:02.7166584Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/googletest'... 2024-04-03T11:40:04.9736932Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/json'... 2024-04-03T11:40:12.6813064Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/pfs'... 2024-04-03T11:40:13.7807382Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2024-04-03T11:40:13.8091220Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2024-04-03T11:40:13.8550110Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2024-04-03T11:40:13.8797570Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2024-04-03T11:40:13.8835357Z Submodule 'doc' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:13.8892158Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc'... 2024-04-03T11:40:15.0162467Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2024-04-03T11:40:15.0516585Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2024-04-03T11:40:15.1031889Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '58d77fa8070e8cec2dc1ed015d66b454c8d78850' 2024-04-03T11:40:15.2175880Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2024-04-03T11:40:15.2429865Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2024-04-03T11:40:15.2908041Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out 'a33701196adfad74917046096bf5a2aa0ab0bb50' 2024-04-03T11:40:15.3613150Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '7aca84427f224eeed3144123d5230d5871e93347' 2024-04-03T11:40:15.4064132Z Submodule path 'third_party/mimalloc': checked out 'b66e3214d8a104669c2ec05ae91ebc26a8f5ab78' 2024-04-03T11:40:15.4454816Z Submodule path 'third_party/nccl/nccl': checked out '48bb7fec7953112ff37499a272317f6663f8f600' 2024-04-03T11:40:15.4657383Z Submodule path 'third_party/neon2sse': checked out '97a126f08ce318023be604d03f88bf0820a9464a' 2024-04-03T11:40:15.5750639Z Submodule path 'third_party/nlohmann': checked out '87cda1d6646592ac5866dc703c8e1839046a6806' 2024-04-03T11:40:15.9342589Z Submodule path 'third_party/onnx': checked out 'ccde5da81388ffa770ca98b64e07f803ad089414' 2024-04-03T11:40:15.9408639Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:15.9414620Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:15.9469813Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/benchmark'... 2024-04-03T11:40:17.2222829Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/pybind11'... 2024-04-03T11:40:19.2394778Z Submodule path 'third_party/onnx/third_party/benchmark': checked out '2dd015dfef425c866d9a43f2c67d8b52d709acb6' 2024-04-03T11:40:19.2911201Z Submodule path 'third_party/onnx/third_party/pybind11': checked out '5b0a6fc2017fcc176545afe3e09c9f9885283242' 2024-04-03T11:40:19.3224922Z Submodule path 'third_party/onnx-tensorrt': checked out 'c153211418a7c57ce071d9ce2a41f8d1c85a878f' 2024-04-03T11:40:19.3246625Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:19.3303598Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx-tensorrt/third_party/onnx'... 2024-04-03T11:40:22.9419183Z Submodule path 'third_party/onnx-tensorrt/third_party/onnx': checked out '765f5ee823a67a866f4bd28a9860e81f3c811ce8' 2024-04-03T11:40:22.9442418Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:22.9448328Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:22.9488498Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark'... 2024-04-03T11:40:24.2550855Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11'... 2024-04-03T11:40:26.2945501Z Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark': checked out 'e776aa0275e293707b6a0901e0e8d8a8a3679508' 2024-04-03T11:40:26.3775593Z Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11': checked out 'a1041190c8b8ff0cd9e2f0752248ad5e3789ea0c' 2024-04-03T11:40:26.3795324Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:26.3838724Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang'... 2024-04-03T11:40:27.2031176Z Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-04-03T11:40:27.2258793Z Submodule path 'third_party/pocketfft': checked out '9d3ab05a7fffbc71a492bc6a17be034e83e8f0fe' 2024-04-03T11:40:27.5170606Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2024-04-03T11:40:27.5200710Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:27.5205800Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:27.5253993Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/benchmark'... 2024-04-03T11:40:28.7722200Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/googletest'... 2024-04-03T11:40:30.9490183Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2024-04-03T11:40:31.0300008Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2024-04-03T11:40:31.0466127Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2024-04-03T11:40:31.0639350Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2024-04-03T11:40:31.0992963Z Submodule path 'third_party/pybind11': checked out '3e9dfa2866941655c56877882565e7577de6fc7b' 2024-04-03T11:40:31.1438259Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2024-04-03T11:40:31.1961525Z Submodule path 'third_party/sleef': checked out '60e76d2bce17d278b439d9da17177c8f957a9e9b' 2024-04-03T11:40:31.3324438Z Submodule path 'third_party/tbb': checked out 'a51a90bc609bb73db8ea13841b5cf7aa4344d4a9' 2024-04-03T11:40:31.3663304Z Submodule path 'third_party/tensorpipe': checked out '52791a2fd214b2a9dc5759d36725909c1daa7f2e' 2024-04-03T11:40:31.3687147Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:31.3692672Z Submodule 'third_party/libnop' (https://github.com/google/libnop.git) registered for path 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:31.3698024Z Submodule 'third_party/libuv' (https://github.com/libuv/libuv.git) registered for path 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:31.3703286Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:31.3748443Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/googletest'... 2024-04-03T11:40:33.6666430Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libnop'... 2024-04-03T11:40:34.6271407Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libuv'... 2024-04-03T11:40:36.9209410Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11'... 2024-04-03T11:40:39.0257938Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2024-04-03T11:40:39.0581828Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2024-04-03T11:40:39.1421381Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '1dff88e5161cba5c59276d2070d2e304e4dcb242' 2024-04-03T11:40:39.1818137Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2024-04-03T11:40:39.1837966Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:39.1891823Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11/tools/clang'... 2024-04-03T11:40:39.9309443Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-04-03T11:40:40.0925989Z Submodule path 'third_party/zstd': checked out 'aec56a52fbab207fc639a1937d1e708a282edca8' 2024-04-03T11:40:40.0969022Z ##[debug]0 2024-04-03T11:40:40.0984414Z ##[debug]Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2024-04-03T11:40:40.0987549Z ##[debug]Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2024-04-03T11:40:40.0990604Z ##[debug]Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2024-04-03T11:40:40.0993648Z ##[debug]Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2024-04-03T11:40:40.0996751Z ##[debug]Submodule path 'third_party/QNNPACK': checked out '7d2a4e9931a82adc3814275b6219a03e24e36b4c' 2024-04-03T11:40:40.1000159Z ##[debug]Submodule path 'third_party/VulkanMemoryAllocator': checked out 'a6bfc237255a6bac1513f7c1ebde6d8aed6b5191' 2024-04-03T11:40:40.1003565Z ##[debug]Submodule path 'third_party/XNNPACK': checked out 'fcbf55af6cf28a4627bcd1f703ab7ad843f0f3a2' 2024-04-03T11:40:40.1006765Z ##[debug]Submodule path 'third_party/benchmark': checked out '0d98dba29d66e93259db7daa53a9327df767a415' 2024-04-03T11:40:40.1010043Z ##[debug]Submodule path 'third_party/cpuinfo': checked out 'd6860c477c99f1fce9e28eb206891af3c0e1a1d7' 2024-04-03T11:40:40.1013130Z ##[debug]Submodule path 'third_party/cub': checked out 'd106ddb991a56c3df1b6d51b2409e36ba8181ce4' 2024-04-03T11:40:40.1016325Z ##[debug]Submodule path 'third_party/cudnn_frontend': checked out '150798fe976556078f443fdb059a1ff0361f58a2' 2024-04-03T11:40:40.1019553Z ##[debug]Submodule path 'third_party/cutlass': checked out 'bbe579a9e3beb6ea6626d9227ec32d0dae119a49' 2024-04-03T11:40:40.1023378Z ##[debug]Submodule path 'third_party/eigen': checked out '3147391d946bb4b6c68edd901f2add6ac1f31f8c' 2024-04-03T11:40:40.1027387Z ##[debug]Submodule path 'third_party/fbgemm': checked out 'dbc3157bf256f1339b3fa1fef2be89ac4078be0e' 2024-04-03T11:40:40.1030054Z ##[debug]Submodule path 'third_party/fbgemm/third_party/asmjit': checked out 'd3fbf7c9bc7c1d1365a94a45614b91c5a3706b81' 2024-04-03T11:40:40.1032661Z ##[debug]Submodule path 'third_party/fbgemm/third_party/cpuinfo': checked out 'ed8b86a253800bafdb7b25c5c399f91bff9cb1f3' 2024-04-03T11:40:40.1035309Z ##[debug]Submodule path 'third_party/fbgemm/third_party/cutlass': checked out 'fc9ebc645b63f3a6bc80aaefde5c063fb72110d6' 2024-04-03T11:40:40.1037984Z ##[debug]Submodule path 'third_party/fbgemm/third_party/googletest': checked out 'cbf019de22c8dd37b2108da35b2748fd702d1796' 2024-04-03T11:40:40.1040667Z ##[debug]Submodule path 'third_party/fbgemm/third_party/hipify_torch': checked out '23f53b025b466d8ec3c45d52290d3442f7fbe6b1' 2024-04-03T11:40:40.1043120Z ##[debug]Submodule path 'third_party/flatbuffers': checked out '01834de25e4bf3975a9a00e816292b1ad0fe184b' 2024-04-03T11:40:40.1045291Z ##[debug]Submodule path 'third_party/fmt': checked out 'e69e5f977d458f2650bb346dadf2ad30c5320281' 2024-04-03T11:40:40.1047381Z ##[debug]Submodule path 'third_party/foxi': checked out 'c278588e34e535f0bb8f00df3880d26928038cad' 2024-04-03T11:40:40.1049640Z ##[debug]Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2024-04-03T11:40:40.1051886Z ##[debug]Submodule path 'third_party/gloo': checked out '5354032ea08eadd7fc4456477f7f7c6308818509' 2024-04-03T11:40:40.1054047Z ##[debug]Submodule path 'third_party/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-04-03T11:40:40.1056216Z ##[debug]Submodule path 'third_party/ideep': checked out '8a6cc4e09dc509f04f83c085e38786b1fb44e14d' 2024-04-03T11:40:40.1058438Z ##[debug]Submodule path 'third_party/ideep/mkl-dnn': checked out '86e6af5974177e513fd3fee58425e1063e7f1361' 2024-04-03T11:40:40.1060814Z ##[debug]Submodule path 'third_party/ios-cmake': checked out '8abaed637d56f1337d6e1d2c4026e25c1eade724' 2024-04-03T11:40:40.1063007Z ##[debug]Submodule path 'third_party/ittapi': checked out '5b8a7d7422611c3a0d799fb5fc5dd4abfae35b42' 2024-04-03T11:40:40.1065166Z ##[debug]Submodule path 'third_party/kineto': checked out '8466a8b111b36dc725e6855d52a0b133d925a8e0' 2024-04-03T11:40:40.1067656Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out '7d04a0053a845370ae06ce317a22a48e9edcc74e' 2024-04-03T11:40:40.1070709Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2024-04-03T11:40:40.1073947Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2024-04-03T11:40:40.1077181Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2024-04-03T11:40:40.1080449Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2024-04-03T11:40:40.1083782Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2024-04-03T11:40:40.1087108Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2024-04-03T11:40:40.1090421Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '58d77fa8070e8cec2dc1ed015d66b454c8d78850' 2024-04-03T11:40:40.1093737Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2024-04-03T11:40:40.1097214Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2024-04-03T11:40:40.1100275Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out 'a33701196adfad74917046096bf5a2aa0ab0bb50' 2024-04-03T11:40:40.1103318Z ##[debug]Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '7aca84427f224eeed3144123d5230d5871e93347' 2024-04-03T11:40:40.1105855Z ##[debug]Submodule path 'third_party/mimalloc': checked out 'b66e3214d8a104669c2ec05ae91ebc26a8f5ab78' 2024-04-03T11:40:40.1108089Z ##[debug]Submodule path 'third_party/nccl/nccl': checked out '48bb7fec7953112ff37499a272317f6663f8f600' 2024-04-03T11:40:40.1110580Z ##[debug]Submodule path 'third_party/neon2sse': checked out '97a126f08ce318023be604d03f88bf0820a9464a' 2024-04-03T11:40:40.1113020Z ##[debug]Submodule path 'third_party/nlohmann': checked out '87cda1d6646592ac5866dc703c8e1839046a6806' 2024-04-03T11:40:40.1115426Z ##[debug]Submodule path 'third_party/onnx': checked out 'ccde5da81388ffa770ca98b64e07f803ad089414' 2024-04-03T11:40:40.1118111Z ##[debug]Submodule path 'third_party/onnx/third_party/benchmark': checked out '2dd015dfef425c866d9a43f2c67d8b52d709acb6' 2024-04-03T11:40:40.1120916Z ##[debug]Submodule path 'third_party/onnx/third_party/pybind11': checked out '5b0a6fc2017fcc176545afe3e09c9f9885283242' 2024-04-03T11:40:40.1123554Z ##[debug]Submodule path 'third_party/onnx-tensorrt': checked out 'c153211418a7c57ce071d9ce2a41f8d1c85a878f' 2024-04-03T11:40:40.1126266Z ##[debug]Submodule path 'third_party/onnx-tensorrt/third_party/onnx': checked out '765f5ee823a67a866f4bd28a9860e81f3c811ce8' 2024-04-03T11:40:40.1129432Z ##[debug]Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark': checked out 'e776aa0275e293707b6a0901e0e8d8a8a3679508' 2024-04-03T11:40:40.1132844Z ##[debug]Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11': checked out 'a1041190c8b8ff0cd9e2f0752248ad5e3789ea0c' 2024-04-03T11:40:40.1136470Z ##[debug]Submodule path 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-04-03T11:40:40.1139580Z ##[debug]Submodule path 'third_party/pocketfft': checked out '9d3ab05a7fffbc71a492bc6a17be034e83e8f0fe' 2024-04-03T11:40:40.1142349Z ##[debug]Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2024-04-03T11:40:40.1145118Z ##[debug]Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2024-04-03T11:40:40.1147844Z ##[debug]Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2024-04-03T11:40:40.1149399Z ##[debug]Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2024-04-03T11:40:40.1150800Z ##[debug]Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2024-04-03T11:40:40.1152228Z ##[debug]Submodule path 'third_party/pybind11': checked out '3e9dfa2866941655c56877882565e7577de6fc7b' 2024-04-03T11:40:40.1153678Z ##[debug]Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2024-04-03T11:40:40.1155123Z ##[debug]Submodule path 'third_party/sleef': checked out '60e76d2bce17d278b439d9da17177c8f957a9e9b' 2024-04-03T11:40:40.1156474Z ##[debug]Submodule path 'third_party/tbb': checked out 'a51a90bc609bb73db8ea13841b5cf7aa4344d4a9' 2024-04-03T11:40:40.1157874Z ##[debug]Submodule path 'third_party/tensorpipe': checked out '52791a2fd214b2a9dc5759d36725909c1daa7f2e' 2024-04-03T11:40:40.1159219Z ##[debug]Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2024-04-03T11:40:40.1160683Z ##[debug]Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2024-04-03T11:40:40.1162102Z ##[debug]Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '1dff88e5161cba5c59276d2070d2e304e4dcb242' 2024-04-03T11:40:40.1163686Z ##[debug]Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2024-04-03T11:40:40.1165351Z ##[debug]Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-04-03T11:40:40.1166721Z ##[debug]Submodule path 'third_party/zstd': checked out 'aec56a52fbab207fc639a1937d1e708a282edca8' 2024-04-03T11:40:40.1167506Z ##[debug] 2024-04-03T11:40:40.1168381Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2024-04-03T11:40:40.1488657Z Entering 'android/libs/fbjni' 2024-04-03T11:40:40.1559337Z Entering 'third_party/FP16' 2024-04-03T11:40:40.1611718Z Entering 'third_party/FXdiv' 2024-04-03T11:40:40.1670867Z Entering 'third_party/NNPACK' 2024-04-03T11:40:40.1736806Z Entering 'third_party/QNNPACK' 2024-04-03T11:40:40.1807700Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:40.1856828Z Entering 'third_party/XNNPACK' 2024-04-03T11:40:40.1944025Z Entering 'third_party/benchmark' 2024-04-03T11:40:40.2028155Z Entering 'third_party/cpuinfo' 2024-04-03T11:40:40.2096763Z Entering 'third_party/cub' 2024-04-03T11:40:40.2167091Z Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:40.2216685Z Entering 'third_party/cutlass' 2024-04-03T11:40:40.2293070Z Entering 'third_party/eigen' 2024-04-03T11:40:40.2364357Z Entering 'third_party/fbgemm' 2024-04-03T11:40:40.2428299Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:40.2481786Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:40.2534322Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:40.2596314Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:40.2649257Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:40.2702868Z Entering 'third_party/flatbuffers' 2024-04-03T11:40:40.2763869Z Entering 'third_party/fmt' 2024-04-03T11:40:40.2819802Z Entering 'third_party/foxi' 2024-04-03T11:40:40.2875892Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:40.2932155Z Entering 'third_party/gloo' 2024-04-03T11:40:40.2989246Z Entering 'third_party/googletest' 2024-04-03T11:40:40.3055048Z Entering 'third_party/ideep' 2024-04-03T11:40:40.3112470Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:40.3194119Z Entering 'third_party/ios-cmake' 2024-04-03T11:40:40.3250329Z Entering 'third_party/ittapi' 2024-04-03T11:40:40.3305808Z Entering 'third_party/kineto' 2024-04-03T11:40:40.3361840Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:40.3415279Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:40.3470229Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:40.3522337Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:40.3573956Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:40.3626147Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:40.3683146Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:40.3739165Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:40.3791126Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:40.3845269Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:40.3899913Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:40.3952777Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:40.4006929Z Entering 'third_party/mimalloc' 2024-04-03T11:40:40.4063364Z Entering 'third_party/nccl/nccl' 2024-04-03T11:40:40.4119433Z Entering 'third_party/neon2sse' 2024-04-03T11:40:40.4176408Z Entering 'third_party/nlohmann' 2024-04-03T11:40:40.4233613Z Entering 'third_party/onnx' 2024-04-03T11:40:40.4316400Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.4371975Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.4432556Z Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:40.4487763Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:40.4548505Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.4602055Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.4655949Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:40.4718267Z Entering 'third_party/pocketfft' 2024-04-03T11:40:40.4774194Z Entering 'third_party/protobuf' 2024-04-03T11:40:40.4835026Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:40.4888240Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:40.4943512Z Entering 'third_party/psimd' 2024-04-03T11:40:40.5000510Z Entering 'third_party/pthreadpool' 2024-04-03T11:40:40.5055716Z Entering 'third_party/pybind11' 2024-04-03T11:40:40.5112592Z Entering 'third_party/python-peachpy' 2024-04-03T11:40:40.5167644Z Entering 'third_party/sleef' 2024-04-03T11:40:40.5224151Z Entering 'third_party/tbb' 2024-04-03T11:40:40.5282837Z Entering 'third_party/tensorpipe' 2024-04-03T11:40:40.5340203Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:40.5392084Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:40.5443221Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:40.5495063Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:40.5545810Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:40.5603059Z Entering 'third_party/zstd' 2024-04-03T11:40:40.5683841Z ##[debug]0 2024-04-03T11:40:40.5692760Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T11:40:40.5694087Z ##[debug]Entering 'third_party/FP16' 2024-04-03T11:40:40.5695374Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T11:40:40.5696700Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T11:40:40.5698012Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T11:40:40.5699528Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:40.5701283Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T11:40:40.5702651Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T11:40:40.5704001Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T11:40:40.5705314Z ##[debug]Entering 'third_party/cub' 2024-04-03T11:40:40.5706662Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:40.5708070Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T11:40:40.5709375Z ##[debug]Entering 'third_party/eigen' 2024-04-03T11:40:40.5710673Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T11:40:40.5712192Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:40.5714008Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:40.5715847Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:40.5717726Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:40.5719679Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:40.5721408Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T11:40:40.5722787Z ##[debug]Entering 'third_party/fmt' 2024-04-03T11:40:40.5724019Z ##[debug]Entering 'third_party/foxi' 2024-04-03T11:40:40.5725428Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:40.5726863Z ##[debug]Entering 'third_party/gloo' 2024-04-03T11:40:40.5728173Z ##[debug]Entering 'third_party/googletest' 2024-04-03T11:40:40.5729544Z ##[debug]Entering 'third_party/ideep' 2024-04-03T11:40:40.5730908Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:40.5732338Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T11:40:40.5733669Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T11:40:40.5734964Z ##[debug]Entering 'third_party/kineto' 2024-04-03T11:40:40.5736651Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:40.5739742Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:40.5742752Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:40.5745456Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:40.5748196Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:40.5751040Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:40.5753845Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:40.5756647Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:40.5759447Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:40.5762179Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:40.5764564Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:40.5766686Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:40.5768522Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T11:40:40.5769880Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T11:40:40.5771225Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T11:40:40.5772572Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T11:40:40.5773877Z ##[debug]Entering 'third_party/onnx' 2024-04-03T11:40:40.5775387Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.5777193Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.5778847Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:40.5780965Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:40.5783286Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.5785957Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.5787611Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:40.5788582Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T11:40:40.5789184Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T11:40:40.5789903Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:40.5790761Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:40.5791497Z ##[debug]Entering 'third_party/psimd' 2024-04-03T11:40:40.5792091Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T11:40:40.5792703Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T11:40:40.5793325Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T11:40:40.5793946Z ##[debug]Entering 'third_party/sleef' 2024-04-03T11:40:40.5794505Z ##[debug]Entering 'third_party/tbb' 2024-04-03T11:40:40.5795073Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T11:40:40.5795828Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:40.5796714Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:40.5797468Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:40.5798186Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:40.5798989Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:40.5799667Z ##[debug]Entering 'third_party/zstd' 2024-04-03T11:40:40.5800107Z ##[debug] 2024-04-03T11:40:40.5800731Z ::endgroup:: 2024-04-03T11:40:40.5801072Z ##[endgroup] 2024-04-03T11:40:40.5801701Z ::group::Persisting credentials for submodules 2024-04-03T11:40:40.5802272Z ##[group]Persisting credentials for submodules 2024-04-03T11:40:40.5804301Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2024-04-03T11:40:40.6187586Z Entering 'android/libs/fbjni' 2024-04-03T11:40:40.6272004Z Entering 'third_party/FP16' 2024-04-03T11:40:40.6324107Z Entering 'third_party/FXdiv' 2024-04-03T11:40:40.6372184Z Entering 'third_party/NNPACK' 2024-04-03T11:40:40.6437601Z Entering 'third_party/QNNPACK' 2024-04-03T11:40:40.6504642Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:40.6570391Z Entering 'third_party/XNNPACK' 2024-04-03T11:40:40.6678554Z Entering 'third_party/benchmark' 2024-04-03T11:40:40.6757902Z Entering 'third_party/cpuinfo' 2024-04-03T11:40:40.6826364Z Entering 'third_party/cub' 2024-04-03T11:40:40.6894152Z Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:40.6960842Z Entering 'third_party/cutlass' 2024-04-03T11:40:40.7041557Z Entering 'third_party/eigen' 2024-04-03T11:40:40.7097906Z Entering 'third_party/fbgemm' 2024-04-03T11:40:40.7141202Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:40.7195748Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:40.7253511Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:40.7320072Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:40.7377302Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:40.7419386Z Entering 'third_party/flatbuffers' 2024-04-03T11:40:40.7480374Z Entering 'third_party/fmt' 2024-04-03T11:40:40.7532513Z Entering 'third_party/foxi' 2024-04-03T11:40:40.7573809Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:40.7630841Z Entering 'third_party/gloo' 2024-04-03T11:40:40.7692144Z Entering 'third_party/googletest' 2024-04-03T11:40:40.7759286Z Entering 'third_party/ideep' 2024-04-03T11:40:40.7812205Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:40.7882680Z Entering 'third_party/ios-cmake' 2024-04-03T11:40:40.7945044Z Entering 'third_party/ittapi' 2024-04-03T11:40:40.7991238Z Entering 'third_party/kineto' 2024-04-03T11:40:40.8035853Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:40.8095133Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:40.8155701Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:40.8210746Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:40.8272819Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:40.8319246Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:40.8382607Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:40.8444536Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:40.8508967Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:40.8573373Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:40.8634413Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:40.8699934Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:40.8780357Z Entering 'third_party/mimalloc' 2024-04-03T11:40:40.8849800Z Entering 'third_party/nccl/nccl' 2024-04-03T11:40:40.8932873Z Entering 'third_party/neon2sse' 2024-04-03T11:40:40.9005564Z Entering 'third_party/nlohmann' 2024-04-03T11:40:40.9077962Z Entering 'third_party/onnx' 2024-04-03T11:40:40.9162951Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.9237962Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.9334663Z Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:40.9422231Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:40.9492422Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:40.9553754Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:40.9626702Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:40.9709408Z Entering 'third_party/pocketfft' 2024-04-03T11:40:40.9800010Z Entering 'third_party/protobuf' 2024-04-03T11:40:40.9853846Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:40.9925554Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:40.9995292Z Entering 'third_party/psimd' 2024-04-03T11:40:41.0072953Z Entering 'third_party/pthreadpool' 2024-04-03T11:40:41.0124709Z Entering 'third_party/pybind11' 2024-04-03T11:40:41.0167739Z Entering 'third_party/python-peachpy' 2024-04-03T11:40:41.0218547Z Entering 'third_party/sleef' 2024-04-03T11:40:41.0280591Z Entering 'third_party/tbb' 2024-04-03T11:40:41.0338276Z Entering 'third_party/tensorpipe' 2024-04-03T11:40:41.0416703Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:41.0503460Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:41.0590093Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:41.0647761Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:41.0694749Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.0762388Z Entering 'third_party/zstd' 2024-04-03T11:40:41.0856424Z ##[debug]0 2024-04-03T11:40:41.0861989Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T11:40:41.0863311Z ##[debug]Entering 'third_party/FP16' 2024-04-03T11:40:41.0864854Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T11:40:41.0866180Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T11:40:41.0867515Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T11:40:41.0869060Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:41.0870601Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T11:40:41.0871966Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T11:40:41.0873317Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T11:40:41.0874622Z ##[debug]Entering 'third_party/cub' 2024-04-03T11:40:41.0876011Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:41.0877452Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T11:40:41.0878747Z ##[debug]Entering 'third_party/eigen' 2024-04-03T11:40:41.0880076Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T11:40:41.0881605Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:41.0883395Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:41.0885235Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:41.0887140Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:41.0889109Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:41.0890818Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T11:40:41.0892200Z ##[debug]Entering 'third_party/fmt' 2024-04-03T11:40:41.0893460Z ##[debug]Entering 'third_party/foxi' 2024-04-03T11:40:41.0894866Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:41.0896338Z ##[debug]Entering 'third_party/gloo' 2024-04-03T11:40:41.0897681Z ##[debug]Entering 'third_party/googletest' 2024-04-03T11:40:41.0899030Z ##[debug]Entering 'third_party/ideep' 2024-04-03T11:40:41.0900532Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:41.0901978Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T11:40:41.0903316Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T11:40:41.0904635Z ##[debug]Entering 'third_party/kineto' 2024-04-03T11:40:41.0906340Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:41.0908759Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:41.0911459Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:41.0914183Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:41.0916931Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:41.0920366Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:41.0923272Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:41.0926366Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:41.0929155Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:41.0931842Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:41.0934241Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:41.0936394Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:41.0938202Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T11:40:41.0939557Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T11:40:41.0941304Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T11:40:41.0942640Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T11:40:41.0943993Z ##[debug]Entering 'third_party/onnx' 2024-04-03T11:40:41.0945524Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.0946969Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.0947713Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:41.0948463Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:41.0949468Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.0950613Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.0951848Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.0952817Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T11:40:41.0953412Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T11:40:41.0954134Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:41.0955011Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:41.0955747Z ##[debug]Entering 'third_party/psimd' 2024-04-03T11:40:41.0956343Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T11:40:41.0956954Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T11:40:41.0957486Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T11:40:41.0958011Z ##[debug]Entering 'third_party/sleef' 2024-04-03T11:40:41.0958486Z ##[debug]Entering 'third_party/tbb' 2024-04-03T11:40:41.0958975Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T11:40:41.0959605Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:41.0960344Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:41.0961059Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:41.0961774Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:41.0962583Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.0963273Z ##[debug]Entering 'third_party/zstd' 2024-04-03T11:40:41.0963708Z ##[debug] 2024-04-03T11:40:41.0965396Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2024-04-03T11:40:41.1370979Z Entering 'android/libs/fbjni' 2024-04-03T11:40:41.1462635Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2024-04-03T11:40:41.1489043Z Entering 'third_party/FP16' 2024-04-03T11:40:41.1558828Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2024-04-03T11:40:41.1590591Z Entering 'third_party/FXdiv' 2024-04-03T11:40:41.1650793Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2024-04-03T11:40:41.1677554Z Entering 'third_party/NNPACK' 2024-04-03T11:40:41.1731392Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2024-04-03T11:40:41.1752224Z Entering 'third_party/QNNPACK' 2024-04-03T11:40:41.1794647Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/QNNPACK/config remote.origin.url 2024-04-03T11:40:41.1816706Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:41.1871880Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2024-04-03T11:40:41.1899288Z Entering 'third_party/XNNPACK' 2024-04-03T11:40:41.1940562Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2024-04-03T11:40:41.1975818Z Entering 'third_party/benchmark' 2024-04-03T11:40:41.2027348Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.2054085Z Entering 'third_party/cpuinfo' 2024-04-03T11:40:41.2099093Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2024-04-03T11:40:41.2121667Z Entering 'third_party/cub' 2024-04-03T11:40:41.2160594Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cub/config remote.origin.url 2024-04-03T11:40:41.2187518Z Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:41.2241976Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2024-04-03T11:40:41.2260792Z Entering 'third_party/cutlass' 2024-04-03T11:40:41.2296814Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2024-04-03T11:40:41.2323390Z Entering 'third_party/eigen' 2024-04-03T11:40:41.2376692Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/eigen/config remote.origin.url 2024-04-03T11:40:41.2408733Z Entering 'third_party/fbgemm' 2024-04-03T11:40:41.2448843Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2024-04-03T11:40:41.2466444Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:41.2518781Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/asmjit/config remote.origin.url 2024-04-03T11:40:41.2543449Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:41.2585071Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cpuinfo/config remote.origin.url 2024-04-03T11:40:41.2601232Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:41.2637704Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cutlass/config remote.origin.url 2024-04-03T11:40:41.2667303Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:41.2720102Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.2738961Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:41.2772319Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/hipify_torch/config remote.origin.url 2024-04-03T11:40:41.2789482Z Entering 'third_party/flatbuffers' 2024-04-03T11:40:41.2839051Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2024-04-03T11:40:41.2870411Z Entering 'third_party/fmt' 2024-04-03T11:40:41.2925982Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.2949561Z Entering 'third_party/foxi' 2024-04-03T11:40:41.2988999Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/foxi/config remote.origin.url 2024-04-03T11:40:41.3006383Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:41.3055117Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2024-04-03T11:40:41.3081191Z Entering 'third_party/gloo' 2024-04-03T11:40:41.3126411Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2024-04-03T11:40:41.3145970Z Entering 'third_party/googletest' 2024-04-03T11:40:41.3185006Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.3210327Z Entering 'third_party/ideep' 2024-04-03T11:40:41.3263001Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2024-04-03T11:40:41.3284011Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:41.3349432Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2024-04-03T11:40:41.3385968Z Entering 'third_party/ios-cmake' 2024-04-03T11:40:41.3438812Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ios-cmake/config remote.origin.url 2024-04-03T11:40:41.3464862Z Entering 'third_party/ittapi' 2024-04-03T11:40:41.3516631Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2024-04-03T11:40:41.3540914Z Entering 'third_party/kineto' 2024-04-03T11:40:41.3580781Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2024-04-03T11:40:41.3599688Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:41.3653402Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2024-04-03T11:40:41.3677602Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:41.3741348Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2024-04-03T11:40:41.3773989Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:41.3814926Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2024-04-03T11:40:41.3832499Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:41.3869915Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.3894002Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:41.3950131Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2024-04-03T11:40:41.3968130Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:41.4038121Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2024-04-03T11:40:41.4074406Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:41.4129956Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2024-04-03T11:40:41.4154335Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:41.4206134Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.4231475Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:41.4270016Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2024-04-03T11:40:41.4288060Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:41.4327593Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2024-04-03T11:40:41.4356810Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:41.4408551Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.4433582Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:41.4478887Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.4498744Z Entering 'third_party/mimalloc' 2024-04-03T11:40:41.4564618Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2024-04-03T11:40:41.4595113Z Entering 'third_party/nccl/nccl' 2024-04-03T11:40:41.4647804Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nccl/nccl/config remote.origin.url 2024-04-03T11:40:41.4669591Z Entering 'third_party/neon2sse' 2024-04-03T11:40:41.4709263Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/neon2sse/config remote.origin.url 2024-04-03T11:40:41.4728528Z Entering 'third_party/nlohmann' 2024-04-03T11:40:41.4770250Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2024-04-03T11:40:41.4799890Z Entering 'third_party/onnx' 2024-04-03T11:40:41.4845681Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2024-04-03T11:40:41.4876742Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.4928167Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.4954386Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.5002042Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.5023754Z Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:41.5091836Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/config remote.origin.url 2024-04-03T11:40:41.5113943Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:41.5162410Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/config remote.origin.url 2024-04-03T11:40:41.5194330Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.5249101Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.5275016Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.5328415Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.5353218Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.5406464Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-04-03T11:40:41.5436970Z Entering 'third_party/pocketfft' 2024-04-03T11:40:41.5491085Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2024-04-03T11:40:41.5518096Z Entering 'third_party/protobuf' 2024-04-03T11:40:41.5555149Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2024-04-03T11:40:41.5574895Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:41.5624437Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.5649671Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:41.5700104Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.5720487Z Entering 'third_party/psimd' 2024-04-03T11:40:41.5787316Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2024-04-03T11:40:41.5807739Z Entering 'third_party/pthreadpool' 2024-04-03T11:40:41.5842636Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2024-04-03T11:40:41.5859379Z Entering 'third_party/pybind11' 2024-04-03T11:40:41.5912152Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.5942275Z Entering 'third_party/python-peachpy' 2024-04-03T11:40:41.5984344Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2024-04-03T11:40:41.6001193Z Entering 'third_party/sleef' 2024-04-03T11:40:41.6035311Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2024-04-03T11:40:41.6056884Z Entering 'third_party/tbb' 2024-04-03T11:40:41.6113928Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tbb/config remote.origin.url 2024-04-03T11:40:41.6137129Z Entering 'third_party/tensorpipe' 2024-04-03T11:40:41.6176651Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2024-04-03T11:40:41.6196005Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:41.6249741Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6275186Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:41.6321889Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2024-04-03T11:40:41.6336194Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:41.6368887Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2024-04-03T11:40:41.6384312Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:41.6417395Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.6433785Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.6494415Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-04-03T11:40:41.6525457Z Entering 'third_party/zstd' 2024-04-03T11:40:41.6569740Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/zstd/config remote.origin.url 2024-04-03T11:40:41.6611272Z ##[debug]0 2024-04-03T11:40:41.6626854Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T11:40:41.6629552Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2024-04-03T11:40:41.6632933Z ##[debug]Entering 'third_party/FP16' 2024-04-03T11:40:41.6635726Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2024-04-03T11:40:41.6638766Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T11:40:41.6641807Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2024-04-03T11:40:41.6644677Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T11:40:41.6647431Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2024-04-03T11:40:41.6650124Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T11:40:41.6653110Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/QNNPACK/config remote.origin.url 2024-04-03T11:40:41.6656306Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:41.6660490Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2024-04-03T11:40:41.6663508Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T11:40:41.6666163Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2024-04-03T11:40:41.6668447Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T11:40:41.6670405Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.6672381Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T11:40:41.6674333Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2024-04-03T11:40:41.6676258Z ##[debug]Entering 'third_party/cub' 2024-04-03T11:40:41.6678077Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cub/config remote.origin.url 2024-04-03T11:40:41.6680041Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:41.6682108Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2024-04-03T11:40:41.6684144Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T11:40:41.6686047Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2024-04-03T11:40:41.6688003Z ##[debug]Entering 'third_party/eigen' 2024-04-03T11:40:41.6689870Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/eigen/config remote.origin.url 2024-04-03T11:40:41.6691792Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T11:40:41.6693698Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2024-04-03T11:40:41.6695813Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:41.6698259Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/asmjit/config remote.origin.url 2024-04-03T11:40:41.6700955Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:41.6703428Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cpuinfo/config remote.origin.url 2024-04-03T11:40:41.6705900Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:41.6708521Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cutlass/config remote.origin.url 2024-04-03T11:40:41.6711213Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:41.6714251Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6717222Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:41.6720021Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/hipify_torch/config remote.origin.url 2024-04-03T11:40:41.6722962Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T11:40:41.6725351Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2024-04-03T11:40:41.6727629Z ##[debug]Entering 'third_party/fmt' 2024-04-03T11:40:41.6729796Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.6731980Z ##[debug]Entering 'third_party/foxi' 2024-04-03T11:40:41.6734115Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/foxi/config remote.origin.url 2024-04-03T11:40:41.6736882Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:41.6739443Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2024-04-03T11:40:41.6742326Z ##[debug]Entering 'third_party/gloo' 2024-04-03T11:40:41.6744382Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2024-04-03T11:40:41.6746492Z ##[debug]Entering 'third_party/googletest' 2024-04-03T11:40:41.6748732Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6751265Z ##[debug]Entering 'third_party/ideep' 2024-04-03T11:40:41.6753674Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2024-04-03T11:40:41.6756501Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:41.6759244Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2024-04-03T11:40:41.6762208Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T11:40:41.6764946Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ios-cmake/config remote.origin.url 2024-04-03T11:40:41.6767640Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T11:40:41.6770008Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2024-04-03T11:40:41.6772491Z ##[debug]Entering 'third_party/kineto' 2024-04-03T11:40:41.6775189Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2024-04-03T11:40:41.6778176Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:41.6782186Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2024-04-03T11:40:41.6786294Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:41.6789080Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2024-04-03T11:40:41.6791462Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:41.6793800Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2024-04-03T11:40:41.6796190Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:41.6798751Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.6801061Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:41.6803576Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2024-04-03T11:40:41.6806080Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:41.6808619Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2024-04-03T11:40:41.6811116Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:41.6813503Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2024-04-03T11:40:41.6815886Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:41.6818494Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6820846Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:41.6823208Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2024-04-03T11:40:41.6825583Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:41.6827949Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2024-04-03T11:40:41.6830086Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:41.6832052Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2024-04-03T11:40:41.6834021Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:41.6835908Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6837803Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T11:40:41.6839276Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2024-04-03T11:40:41.6840764Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T11:40:41.6842198Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nccl/nccl/config remote.origin.url 2024-04-03T11:40:41.6843703Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T11:40:41.6845109Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/neon2sse/config remote.origin.url 2024-04-03T11:40:41.6846624Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T11:40:41.6848055Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2024-04-03T11:40:41.6849447Z ##[debug]Entering 'third_party/onnx' 2024-04-03T11:40:41.6850861Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2024-04-03T11:40:41.6852382Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.6854277Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.6856304Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.6858046Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.6860021Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:41.6861573Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/config remote.origin.url 2024-04-03T11:40:41.6863165Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:41.6865096Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/config remote.origin.url 2024-04-03T11:40:41.6867176Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:41.6869531Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.6871849Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:41.6874321Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.6876604Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.6879192Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-04-03T11:40:41.6881358Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T11:40:41.6882807Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2024-04-03T11:40:41.6884269Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T11:40:41.6885573Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2024-04-03T11:40:41.6887028Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:41.6888925Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2024-04-03T11:40:41.6890692Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:41.6892591Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6894333Z ##[debug]Entering 'third_party/psimd' 2024-04-03T11:40:41.6895790Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2024-04-03T11:40:41.6897394Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T11:40:41.6898980Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2024-04-03T11:40:41.6900842Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T11:40:41.6902288Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.6903783Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T11:40:41.6905255Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2024-04-03T11:40:41.6906719Z ##[debug]Entering 'third_party/sleef' 2024-04-03T11:40:41.6908287Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2024-04-03T11:40:41.6909644Z ##[debug]Entering 'third_party/tbb' 2024-04-03T11:40:41.6911161Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tbb/config remote.origin.url 2024-04-03T11:40:41.6912663Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T11:40:41.6914400Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2024-04-03T11:40:41.6916096Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:41.6918033Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2024-04-03T11:40:41.6919877Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:41.6921813Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2024-04-03T11:40:41.6923597Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:41.6925447Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2024-04-03T11:40:41.6927373Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:41.6929297Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2024-04-03T11:40:41.6931327Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:41.6933432Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-04-03T11:40:41.6935444Z ##[debug]Entering 'third_party/zstd' 2024-04-03T11:40:41.6936766Z ##[debug]file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/zstd/config remote.origin.url 2024-04-03T11:40:41.6938014Z ##[debug] 2024-04-03T11:40:41.6940297Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config' 2024-04-03T11:40:41.6943345Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config' 2024-04-03T11:40:41.6946293Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config' 2024-04-03T11:40:41.6949315Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config' 2024-04-03T11:40:41.6951853Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/QNNPACK/config' 2024-04-03T11:40:41.6954459Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config' 2024-04-03T11:40:41.6957045Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config' 2024-04-03T11:40:41.6959521Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config' 2024-04-03T11:40:41.6962005Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config' 2024-04-03T11:40:41.6964426Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cub/config' 2024-04-03T11:40:41.6966934Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config' 2024-04-03T11:40:41.6969461Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config' 2024-04-03T11:40:41.6972178Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/eigen/config' 2024-04-03T11:40:41.6974655Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config' 2024-04-03T11:40:41.6977525Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/asmjit/config' 2024-04-03T11:40:41.6980639Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cpuinfo/config' 2024-04-03T11:40:41.6983564Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cutlass/config' 2024-04-03T11:40:41.6986493Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/googletest/config' 2024-04-03T11:40:41.6989492Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/hipify_torch/config' 2024-04-03T11:40:41.6992274Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config' 2024-04-03T11:40:41.6994734Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config' 2024-04-03T11:40:41.6997137Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/foxi/config' 2024-04-03T11:40:41.6999666Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config' 2024-04-03T11:40:41.7002176Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config' 2024-04-03T11:40:41.7004644Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config' 2024-04-03T11:40:41.7007137Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config' 2024-04-03T11:40:41.7009706Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config' 2024-04-03T11:40:41.7012306Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ios-cmake/config' 2024-04-03T11:40:41.7014781Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config' 2024-04-03T11:40:41.7017224Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config' 2024-04-03T11:40:41.7020209Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config' 2024-04-03T11:40:41.7023551Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config' 2024-04-03T11:40:41.7027039Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config' 2024-04-03T11:40:41.7030517Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config' 2024-04-03T11:40:41.7034028Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config' 2024-04-03T11:40:41.7040265Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config' 2024-04-03T11:40:41.7044118Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config' 2024-04-03T11:40:41.7047670Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config' 2024-04-03T11:40:41.7051123Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config' 2024-04-03T11:40:41.7054705Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config' 2024-04-03T11:40:41.7058006Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config' 2024-04-03T11:40:41.7060765Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config' 2024-04-03T11:40:41.7063628Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config' 2024-04-03T11:40:41.7066151Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nccl/nccl/config' 2024-04-03T11:40:41.7068684Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/neon2sse/config' 2024-04-03T11:40:41.7071196Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config' 2024-04-03T11:40:41.7073670Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config' 2024-04-03T11:40:41.7076364Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/benchmark/config' 2024-04-03T11:40:41.7079114Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config' 2024-04-03T11:40:41.7081407Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/config' 2024-04-03T11:40:41.7083758Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/config' 2024-04-03T11:40:41.7086483Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/benchmark/config' 2024-04-03T11:40:41.7089414Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/config' 2024-04-03T11:40:41.7092486Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx-tensorrt/modules/third_party/onnx/modules/third_party/pybind11/modules/tools/clang/config' 2024-04-03T11:40:41.7095119Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config' 2024-04-03T11:40:41.7097362Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config' 2024-04-03T11:40:41.7099515Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config' 2024-04-03T11:40:41.7102606Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config' 2024-04-03T11:40:41.7105236Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config' 2024-04-03T11:40:41.7107757Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config' 2024-04-03T11:40:41.7110201Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config' 2024-04-03T11:40:41.7112586Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config' 2024-04-03T11:40:41.7114927Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config' 2024-04-03T11:40:41.7117180Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tbb/config' 2024-04-03T11:40:41.7119272Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config' 2024-04-03T11:40:41.7121432Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config' 2024-04-03T11:40:41.7123761Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config' 2024-04-03T11:40:41.7126051Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config' 2024-04-03T11:40:41.7128372Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config' 2024-04-03T11:40:41.7130826Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config' 2024-04-03T11:40:41.7133041Z ##[debug]Replacing token placeholder in '/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/zstd/config' 2024-04-03T11:40:41.7134793Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2024-04-03T11:40:41.7449961Z Entering 'android/libs/fbjni' 2024-04-03T11:40:41.7529982Z Entering 'third_party/FP16' 2024-04-03T11:40:41.7620248Z Entering 'third_party/FXdiv' 2024-04-03T11:40:41.7703702Z Entering 'third_party/NNPACK' 2024-04-03T11:40:41.7766071Z Entering 'third_party/QNNPACK' 2024-04-03T11:40:41.7828737Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:41.7899614Z Entering 'third_party/XNNPACK' 2024-04-03T11:40:41.7982783Z Entering 'third_party/benchmark' 2024-04-03T11:40:41.8041114Z Entering 'third_party/cpuinfo' 2024-04-03T11:40:41.8099023Z Entering 'third_party/cub' 2024-04-03T11:40:41.8160493Z Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:41.8217580Z Entering 'third_party/cutlass' 2024-04-03T11:40:41.8290046Z Entering 'third_party/eigen' 2024-04-03T11:40:41.8352225Z Entering 'third_party/fbgemm' 2024-04-03T11:40:41.8415686Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:41.8480941Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:41.8539812Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:41.8603988Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:41.8656495Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:41.8712808Z Entering 'third_party/flatbuffers' 2024-04-03T11:40:41.8783065Z Entering 'third_party/fmt' 2024-04-03T11:40:41.8850016Z Entering 'third_party/foxi' 2024-04-03T11:40:41.8921937Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:41.8981384Z Entering 'third_party/gloo' 2024-04-03T11:40:41.9037948Z Entering 'third_party/googletest' 2024-04-03T11:40:41.9094983Z Entering 'third_party/ideep' 2024-04-03T11:40:41.9150646Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:41.9237065Z Entering 'third_party/ios-cmake' 2024-04-03T11:40:41.9302423Z Entering 'third_party/ittapi' 2024-04-03T11:40:41.9369655Z Entering 'third_party/kineto' 2024-04-03T11:40:41.9455209Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:41.9534578Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:41.9618852Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:41.9681928Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:41.9743114Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:41.9795130Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:41.9852752Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:41.9904796Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:41.9957816Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:42.0012130Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:42.0068872Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:42.0125082Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:42.0194542Z Entering 'third_party/mimalloc' 2024-04-03T11:40:42.0251408Z Entering 'third_party/nccl/nccl' 2024-04-03T11:40:42.0309246Z Entering 'third_party/neon2sse' 2024-04-03T11:40:42.0364868Z Entering 'third_party/nlohmann' 2024-04-03T11:40:42.0422824Z Entering 'third_party/onnx' 2024-04-03T11:40:42.0503883Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.0562990Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.0624210Z Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:42.0680219Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:42.0741563Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.0797000Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.0857187Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.0934190Z Entering 'third_party/pocketfft' 2024-04-03T11:40:42.0994876Z Entering 'third_party/protobuf' 2024-04-03T11:40:42.1057989Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:42.1111657Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:42.1168765Z Entering 'third_party/psimd' 2024-04-03T11:40:42.1235795Z Entering 'third_party/pthreadpool' 2024-04-03T11:40:42.1295986Z Entering 'third_party/pybind11' 2024-04-03T11:40:42.1355573Z Entering 'third_party/python-peachpy' 2024-04-03T11:40:42.1412598Z Entering 'third_party/sleef' 2024-04-03T11:40:42.1480266Z Entering 'third_party/tbb' 2024-04-03T11:40:42.1549904Z Entering 'third_party/tensorpipe' 2024-04-03T11:40:42.1616313Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:42.1680349Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:42.1722962Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:42.1769629Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:42.1830913Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.1903039Z Entering 'third_party/zstd' 2024-04-03T11:40:42.1995319Z ##[debug]0 2024-04-03T11:40:42.2000515Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T11:40:42.2002302Z ##[debug]Entering 'third_party/FP16' 2024-04-03T11:40:42.2003599Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T11:40:42.2004895Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T11:40:42.2006203Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T11:40:42.2007737Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:42.2009299Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T11:40:42.2010645Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T11:40:42.2012017Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T11:40:42.2013312Z ##[debug]Entering 'third_party/cub' 2024-04-03T11:40:42.2014659Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:42.2016074Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T11:40:42.2017370Z ##[debug]Entering 'third_party/eigen' 2024-04-03T11:40:42.2018689Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T11:40:42.2020347Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:42.2022166Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:42.2023980Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:42.2025875Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:42.2027853Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:42.2029561Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T11:40:42.2030916Z ##[debug]Entering 'third_party/fmt' 2024-04-03T11:40:42.2032162Z ##[debug]Entering 'third_party/foxi' 2024-04-03T11:40:42.2033559Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:42.2035014Z ##[debug]Entering 'third_party/gloo' 2024-04-03T11:40:42.2036340Z ##[debug]Entering 'third_party/googletest' 2024-04-03T11:40:42.2037696Z ##[debug]Entering 'third_party/ideep' 2024-04-03T11:40:42.2039107Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:42.2040548Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T11:40:42.2041892Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T11:40:42.2043211Z ##[debug]Entering 'third_party/kineto' 2024-04-03T11:40:42.2044924Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:42.2047378Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:42.2050069Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:42.2052777Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:42.2055507Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:42.2058342Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:42.2061303Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:42.2064136Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:42.2066948Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:42.2069653Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:42.2072042Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:42.2074170Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:42.2075981Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T11:40:42.2077358Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T11:40:42.2078736Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T11:40:42.2080102Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T11:40:42.2081421Z ##[debug]Entering 'third_party/onnx' 2024-04-03T11:40:42.2082957Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.2085084Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.2086760Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:42.2088486Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:42.2091038Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.2093680Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.2096552Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.2098769Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T11:40:42.2100548Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T11:40:42.2102198Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:42.2104177Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:42.2105849Z ##[debug]Entering 'third_party/psimd' 2024-04-03T11:40:42.2107195Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T11:40:42.2108326Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T11:40:42.2108974Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T11:40:42.2109595Z ##[debug]Entering 'third_party/sleef' 2024-04-03T11:40:42.2110150Z ##[debug]Entering 'third_party/tbb' 2024-04-03T11:40:42.2110722Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T11:40:42.2111468Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:42.2112342Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:42.2113184Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:42.2114027Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:42.2114981Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.2115796Z ##[debug]Entering 'third_party/zstd' 2024-04-03T11:40:42.2116310Z ##[debug] 2024-04-03T11:40:42.2117664Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2024-04-03T11:40:42.2493338Z Entering 'android/libs/fbjni' 2024-04-03T11:40:42.2567447Z Entering 'third_party/FP16' 2024-04-03T11:40:42.2637581Z Entering 'third_party/FXdiv' 2024-04-03T11:40:42.2703309Z Entering 'third_party/NNPACK' 2024-04-03T11:40:42.2775959Z Entering 'third_party/QNNPACK' 2024-04-03T11:40:42.2833359Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:42.2890060Z Entering 'third_party/XNNPACK' 2024-04-03T11:40:42.2971799Z Entering 'third_party/benchmark' 2024-04-03T11:40:42.3039710Z Entering 'third_party/cpuinfo' 2024-04-03T11:40:42.3108019Z Entering 'third_party/cub' 2024-04-03T11:40:42.3164437Z Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:42.3220936Z Entering 'third_party/cutlass' 2024-04-03T11:40:42.3287697Z Entering 'third_party/eigen' 2024-04-03T11:40:42.3347401Z Entering 'third_party/fbgemm' 2024-04-03T11:40:42.3404542Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:42.3461827Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:42.3515825Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:42.3578555Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:42.3633173Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:42.3689475Z Entering 'third_party/flatbuffers' 2024-04-03T11:40:42.3760081Z Entering 'third_party/fmt' 2024-04-03T11:40:42.3818618Z Entering 'third_party/foxi' 2024-04-03T11:40:42.3876129Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:42.3934075Z Entering 'third_party/gloo' 2024-04-03T11:40:42.3990402Z Entering 'third_party/googletest' 2024-04-03T11:40:42.4047285Z Entering 'third_party/ideep' 2024-04-03T11:40:42.4101714Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:42.4169867Z Entering 'third_party/ios-cmake' 2024-04-03T11:40:42.4236233Z Entering 'third_party/ittapi' 2024-04-03T11:40:42.4293379Z Entering 'third_party/kineto' 2024-04-03T11:40:42.4349923Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:42.4407403Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:42.4462771Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:42.4514698Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:42.4567319Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:42.4618612Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:42.4676640Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:42.4729715Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:42.4781845Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:42.4836308Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:42.4891155Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:42.4946101Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:42.5002088Z Entering 'third_party/mimalloc' 2024-04-03T11:40:42.5063847Z Entering 'third_party/nccl/nccl' 2024-04-03T11:40:42.5122606Z Entering 'third_party/neon2sse' 2024-04-03T11:40:42.5182648Z Entering 'third_party/nlohmann' 2024-04-03T11:40:42.5243689Z Entering 'third_party/onnx' 2024-04-03T11:40:42.5328220Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.5386708Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.5446845Z Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:42.5506470Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:42.5585666Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.5649759Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.5711741Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.5775007Z Entering 'third_party/pocketfft' 2024-04-03T11:40:42.5832664Z Entering 'third_party/protobuf' 2024-04-03T11:40:42.5905739Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:42.5981442Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:42.6046386Z Entering 'third_party/psimd' 2024-04-03T11:40:42.6118483Z Entering 'third_party/pthreadpool' 2024-04-03T11:40:42.6178515Z Entering 'third_party/pybind11' 2024-04-03T11:40:42.6234634Z Entering 'third_party/python-peachpy' 2024-04-03T11:40:42.6293781Z Entering 'third_party/sleef' 2024-04-03T11:40:42.6350140Z Entering 'third_party/tbb' 2024-04-03T11:40:42.6408109Z Entering 'third_party/tensorpipe' 2024-04-03T11:40:42.6463903Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:42.6516847Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:42.6567455Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:42.6620145Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:42.6670149Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.6736380Z Entering 'third_party/zstd' 2024-04-03T11:40:42.6812757Z ##[debug]0 2024-04-03T11:40:42.6821748Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T11:40:42.6823693Z ##[debug]Entering 'third_party/FP16' 2024-04-03T11:40:42.6825564Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T11:40:42.6827562Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T11:40:42.6829526Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T11:40:42.6831589Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T11:40:42.6833868Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T11:40:42.6836513Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T11:40:42.6838604Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T11:40:42.6840558Z ##[debug]Entering 'third_party/cub' 2024-04-03T11:40:42.6843369Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T11:40:42.6845568Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T11:40:42.6847555Z ##[debug]Entering 'third_party/eigen' 2024-04-03T11:40:42.6849991Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T11:40:42.6852339Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T11:40:42.6855108Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T11:40:42.6857935Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T11:40:42.6860967Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T11:40:42.6863775Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T11:40:42.6865914Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T11:40:42.6867289Z ##[debug]Entering 'third_party/fmt' 2024-04-03T11:40:42.6868529Z ##[debug]Entering 'third_party/foxi' 2024-04-03T11:40:42.6869947Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T11:40:42.6871394Z ##[debug]Entering 'third_party/gloo' 2024-04-03T11:40:42.6872725Z ##[debug]Entering 'third_party/googletest' 2024-04-03T11:40:42.6874089Z ##[debug]Entering 'third_party/ideep' 2024-04-03T11:40:42.6875481Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T11:40:42.6876891Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T11:40:42.6878226Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T11:40:42.6879515Z ##[debug]Entering 'third_party/kineto' 2024-04-03T11:40:42.6881196Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T11:40:42.6883612Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T11:40:42.6886347Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T11:40:42.6890424Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T11:40:42.6894714Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T11:40:42.6898919Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T11:40:42.6901987Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T11:40:42.6904821Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T11:40:42.6907622Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T11:40:42.6910392Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T11:40:42.6912747Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T11:40:42.6914863Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T11:40:42.6916692Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T11:40:42.6918127Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T11:40:42.6920166Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T11:40:42.6921973Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T11:40:42.6924002Z ##[debug]Entering 'third_party/onnx' 2024-04-03T11:40:42.6926408Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.6928512Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.6930352Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T11:40:42.6932220Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T11:40:42.6934706Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T11:40:42.6937607Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T11:40:42.6941094Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.6943873Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T11:40:42.6945385Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T11:40:42.6947135Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T11:40:42.6948896Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T11:40:42.6950121Z ##[debug]Entering 'third_party/psimd' 2024-04-03T11:40:42.6951204Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T11:40:42.6952097Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T11:40:42.6952927Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T11:40:42.6953559Z ##[debug]Entering 'third_party/sleef' 2024-04-03T11:40:42.6954121Z ##[debug]Entering 'third_party/tbb' 2024-04-03T11:40:42.6954690Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T11:40:42.6955448Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T11:40:42.6956328Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T11:40:42.6957189Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T11:40:42.6957914Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T11:40:42.6958724Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T11:40:42.6959416Z ##[debug]Entering 'third_party/zstd' 2024-04-03T11:40:42.6959857Z ##[debug] 2024-04-03T11:40:42.6960521Z ::endgroup:: 2024-04-03T11:40:42.6960873Z ##[endgroup] 2024-04-03T11:40:42.6961457Z ##[debug]0 2024-04-03T11:40:42.6962156Z ##[debug]commit d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:40:42.6962766Z ##[debug]Author: Chun Cai 2024-04-03T11:40:42.6963311Z ##[debug]Date: Wed Apr 3 18:10:46 2024 +0800 2024-04-03T11:40:42.6963790Z ##[debug] 2024-04-03T11:40:42.6964086Z ##[debug] Fix UT 2024-04-03T11:40:42.6964414Z ##[debug] 2024-04-03T11:40:42.6964792Z ##[debug] updates the error message in UT 2024-04-03T11:40:42.6965271Z ##[debug] 2024-04-03T11:40:42.6965905Z [command]/usr/bin/git log -1 --format='%H' 2024-04-03T11:40:42.6988507Z 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:40:42.6993064Z ##[debug]0 2024-04-03T11:40:42.6994373Z ##[debug]'d253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:40:42.6995361Z ##[debug] 2024-04-03T11:40:42.7000306Z ##[debug]Unsetting HOME override 2024-04-03T11:40:42.7014626Z ::remove-matcher owner=checkout-git:: 2024-04-03T11:40:42.7063228Z ##[debug]Removed matchers: 'checkout-git' 2024-04-03T11:40:42.7112688Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:40:42.7121798Z ##[debug]Save intra-action state isPost = true 2024-04-03T11:40:42.7123468Z ##[debug]Save intra-action state setSafeDirectory = true 2024-04-03T11:40:42.7125711Z ##[debug]Save intra-action state repositoryPath = /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:40:42.7129194Z ##[debug]Finished: run 2024-04-03T11:40:42.7184426Z ##[debug]Finishing: Checkout PyTorch 2024-04-03T11:40:42.7200914Z ##[debug]Evaluating condition for step: 'Setup ROCm' 2024-04-03T11:40:42.7202494Z ##[debug]Evaluating: success() 2024-04-03T11:40:42.7203118Z ##[debug]Evaluating success: 2024-04-03T11:40:42.7203703Z ##[debug]=> true 2024-04-03T11:40:42.7204244Z ##[debug]Result: true 2024-04-03T11:40:42.7204919Z ##[debug]Starting: Setup ROCm 2024-04-03T11:40:42.7292089Z Prepare all required actions 2024-04-03T11:40:42.7292563Z Getting action download info 2024-04-03T11:40:42.7380243Z ##[debug]Register post job cleanup for action: /./.github/actions/setup-rocm 2024-04-03T11:40:42.7396616Z ##[debug]Loading inputs 2024-04-03T11:40:42.7397464Z ##[debug]Loading env 2024-04-03T11:40:42.7403807Z ##[group]Run ./.github/actions/setup-rocm 2024-04-03T11:40:42.7404258Z env: 2024-04-03T11:40:42.7404560Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.7404946Z ##[endgroup] 2024-04-03T11:40:42.7412410Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:42.7413555Z ##[debug]Evaluating: success() 2024-04-03T11:40:42.7414179Z ##[debug]Evaluating success: 2024-04-03T11:40:42.7414770Z ##[debug]=> true 2024-04-03T11:40:42.7415317Z ##[debug]Result: true 2024-04-03T11:40:42.7415958Z ##[debug]Starting: run 2024-04-03T11:40:42.7425905Z ##[debug]Loading inputs 2024-04-03T11:40:42.7427098Z ##[debug]Loading env 2024-04-03T11:40:42.7434544Z ##[group]Run echo "DOCKER_HOST=unix:///run/user/$(id -u)/docker.sock" >> "${GITHUB_ENV}" 2024-04-03T11:40:42.7435757Z echo "DOCKER_HOST=unix:///run/user/$(id -u)/docker.sock" >> "${GITHUB_ENV}" 2024-04-03T11:40:42.7469824Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:42.7470501Z env: 2024-04-03T11:40:42.7470869Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.7471332Z ##[endgroup] 2024-04-03T11:40:42.7499182Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/1ee0296c-c928-4be1-96f4-fb687d045d45.sh 2024-04-03T11:40:42.7556306Z ##[debug]DOCKER_HOST='unix:///run/user/1123/docker.sock' 2024-04-03T11:40:42.7561150Z ##[debug]Finished: run 2024-04-03T11:40:42.7566168Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:42.7568622Z ##[debug]Evaluating: success() 2024-04-03T11:40:42.7569769Z ##[debug]Evaluating success: 2024-04-03T11:40:42.7570925Z ##[debug]=> true 2024-04-03T11:40:42.7572003Z ##[debug]Result: true 2024-04-03T11:40:42.7573285Z ##[debug]Starting: run 2024-04-03T11:40:42.7592580Z ##[debug]Loading inputs 2024-04-03T11:40:42.7594485Z ##[debug]Loading env 2024-04-03T11:40:42.7605290Z ##[group]Run set -ex 2024-04-03T11:40:42.7605991Z set -ex 2024-04-03T11:40:42.7606600Z  2024-04-03T11:40:42.7607283Z cat ~/.docker/config.json || true 2024-04-03T11:40:42.7609555Z # https://stackoverflow.com/questions/64455468/error-when-logging-into-ecr-with-docker-login-error-saving-credentials-not 2024-04-03T11:40:42.7611784Z rm -f ~/.docker/config.json 2024-04-03T11:40:42.7664452Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:42.7665520Z env: 2024-04-03T11:40:42.7666073Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.7666914Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:42.7667848Z ##[endgroup] 2024-04-03T11:40:42.7718041Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/f060eeff-12f6-4f26-b6b7-d1508bd2568b.sh 2024-04-03T11:40:42.7750178Z + cat /home/pytorchci/.docker/config.json 2024-04-03T11:40:42.7763246Z { 2024-04-03T11:40:42.7764058Z "auths": {} 2024-04-03T11:40:42.7766187Z + rm -f /home/pytorchci/.docker/config.json 2024-04-03T11:40:42.7784193Z } 2024-04-03T11:40:42.7796367Z ##[debug]Finished: run 2024-04-03T11:40:42.7803230Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:42.7809775Z ##[debug]Evaluating: always() 2024-04-03T11:40:42.7811291Z ##[debug]Evaluating always: 2024-04-03T11:40:42.7814170Z ##[debug]=> true 2024-04-03T11:40:42.7815814Z ##[debug]Result: true 2024-04-03T11:40:42.7817584Z ##[debug]Starting: run 2024-04-03T11:40:42.7839685Z ##[debug]Loading inputs 2024-04-03T11:40:42.7841712Z ##[debug]Loading env 2024-04-03T11:40:42.7853063Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-04-03T11:40:42.7854747Z # ignore expansion of "docker ps -q" since it could be empty 2024-04-03T11:40:42.7856036Z # shellcheck disable=SC2046 2024-04-03T11:40:42.7857031Z docker stop $(docker ps -q) || true 2024-04-03T11:40:42.7858070Z # Prune all stopped containers. 2024-04-03T11:40:42.7859089Z docker container prune -f 2024-04-03T11:40:42.7915113Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:42.7916182Z env: 2024-04-03T11:40:42.7916740Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.7917584Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:42.7918514Z ##[endgroup] 2024-04-03T11:40:42.7961627Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/033a7f22-2faa-4280-ae39-f2ba1bc63ac2.sh 2024-04-03T11:40:42.8644125Z "docker stop" requires at least 1 argument. 2024-04-03T11:40:42.8645982Z See 'docker stop --help'. 2024-04-03T11:40:42.8646736Z 2024-04-03T11:40:42.8647424Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2024-04-03T11:40:42.8648631Z 2024-04-03T11:40:42.8649336Z Stop one or more running containers 2024-04-03T11:40:42.8906268Z Total reclaimed space: 0B 2024-04-03T11:40:42.8971586Z ##[debug]Finished: run 2024-04-03T11:40:42.8978770Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:42.8982433Z ##[debug]Evaluating: always() 2024-04-03T11:40:42.8983944Z ##[debug]Evaluating always: 2024-04-03T11:40:42.8985333Z ##[debug]=> true 2024-04-03T11:40:42.8986748Z ##[debug]Result: true 2024-04-03T11:40:42.8989010Z ##[debug]Starting: run 2024-04-03T11:40:42.9022912Z ##[debug]Loading inputs 2024-04-03T11:40:42.9025967Z ##[debug]Loading env 2024-04-03T11:40:42.9043981Z ##[group]Run cat /etc/os-release || true 2024-04-03T11:40:42.9045605Z cat /etc/os-release || true 2024-04-03T11:40:42.9047219Z cat /etc/apt/sources.list.d/rocm.list || true 2024-04-03T11:40:42.9049114Z cat /opt/rocm/.info/version || true 2024-04-03T11:40:42.9050636Z whoami 2024-04-03T11:40:42.9095263Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:42.9095871Z env: 2024-04-03T11:40:42.9096306Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.9096849Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:42.9097403Z ##[endgroup] 2024-04-03T11:40:42.9126838Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/2a6f6be3-cc9f-404c-b3b6-0e9251b1a9c0.sh 2024-04-03T11:40:42.9170782Z PRETTY_NAME="Ubuntu 22.04.1 LTS" 2024-04-03T11:40:42.9172697Z NAME="Ubuntu" 2024-04-03T11:40:42.9173931Z VERSION_ID="22.04" 2024-04-03T11:40:42.9175479Z VERSION="22.04.1 LTS (Jammy Jellyfish)" 2024-04-03T11:40:42.9177300Z VERSION_CODENAME=jammy 2024-04-03T11:40:42.9178621Z ID=ubuntu 2024-04-03T11:40:42.9179555Z ID_LIKE=debian 2024-04-03T11:40:42.9180938Z HOME_URL="https://www.ubuntu.com/" 2024-04-03T11:40:42.9182824Z SUPPORT_URL="https://help.ubuntu.com/" 2024-04-03T11:40:42.9184569Z BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" 2024-04-03T11:40:42.9187652Z PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" 2024-04-03T11:40:42.9190017Z UBUNTU_CODENAME=jammy 2024-04-03T11:40:42.9191683Z deb [arch=amd64] https://repo.radeon.com/rocm/apt/5.6 jammy main 2024-04-03T11:40:42.9205632Z 5.6.0-67 2024-04-03T11:40:42.9241983Z pytorchci 2024-04-03T11:40:42.9255933Z ##[debug]Finished: run 2024-04-03T11:40:42.9264342Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:42.9268240Z ##[debug]Evaluating: always() 2024-04-03T11:40:42.9270257Z ##[debug]Evaluating always: 2024-04-03T11:40:42.9271971Z ##[debug]=> true 2024-04-03T11:40:42.9273848Z ##[debug]Result: true 2024-04-03T11:40:42.9275903Z ##[debug]Starting: run 2024-04-03T11:40:42.9305193Z ##[debug]Loading inputs 2024-04-03T11:40:42.9308869Z ##[debug]Loading env 2024-04-03T11:40:42.9327448Z ##[group]Run rocm-smi 2024-04-03T11:40:42.9328656Z rocm-smi 2024-04-03T11:40:42.9396675Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:42.9397363Z env: 2024-04-03T11:40:42.9397747Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:42.9398248Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:42.9398836Z ##[endgroup] 2024-04-03T11:40:42.9427360Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/74c13f37-8dd5-43d1-a356-6ec938ed461c.sh 2024-04-03T11:40:43.0895455Z 2024-04-03T11:40:43.0895854Z 2024-04-03T11:40:43.0897647Z ========================= ROCm System Management Interface ========================= 2024-04-03T11:40:43.0900305Z =================================== Concise Info =================================== 2024-04-03T11:40:43.0902920Z GPU Temp (DieEdge) AvgPwr SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2024-04-03T11:40:43.0905484Z 0 31.0c 19.0W 938Mhz 350Mhz 14.51% auto 225.0W 0% 0% 2024-04-03T11:40:43.0907793Z 1 30.0c 20.0W 938Mhz 350Mhz 14.51% auto 225.0W 0% 0% 2024-04-03T11:40:43.0910128Z 2 28.0c 16.0W 938Mhz 350Mhz 14.51% auto 225.0W 0% 0% 2024-04-03T11:40:43.0912575Z 3 27.0c 17.0W 938Mhz 350Mhz 14.51% auto 225.0W 0% 0% 2024-04-03T11:40:43.0914630Z ==================================================================================== 2024-04-03T11:40:43.0917241Z =============================== End of ROCm SMI Log ================================ 2024-04-03T11:40:43.1018746Z ##[debug]Finished: run 2024-04-03T11:40:43.1026210Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.1029775Z ##[debug]Evaluating: always() 2024-04-03T11:40:43.1031483Z ##[debug]Evaluating always: 2024-04-03T11:40:43.1033165Z ##[debug]=> true 2024-04-03T11:40:43.1034828Z ##[debug]Result: true 2024-04-03T11:40:43.1036543Z ##[debug]Starting: run 2024-04-03T11:40:43.1062467Z ##[debug]Loading inputs 2024-04-03T11:40:43.1065446Z ##[debug]Loading env 2024-04-03T11:40:43.1081633Z ##[group]Run rocminfo 2024-04-03T11:40:43.1082601Z rocminfo 2024-04-03T11:40:43.1142165Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:43.1143335Z env: 2024-04-03T11:40:43.1144146Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.1145177Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.1146354Z ##[endgroup] 2024-04-03T11:40:43.1197813Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/579f00f5-bbb0-4a9f-bc94-cd97b3506304.sh 2024-04-03T11:40:43.1952330Z ROCk module is loaded 2024-04-03T11:40:43.1953837Z ===================== 2024-04-03T11:40:43.1955307Z HSA System Attributes 2024-04-03T11:40:43.1956693Z ===================== 2024-04-03T11:40:43.1958352Z Runtime Version: 1.1 2024-04-03T11:40:43.1959944Z System Timestamp Freq.: 1000.000000MHz 2024-04-03T11:40:43.1962308Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2024-04-03T11:40:43.1965682Z Machine Model: LARGE 2024-04-03T11:40:43.1967751Z System Endianness: LITTLE 2024-04-03T11:40:43.1969586Z 2024-04-03T11:40:43.1970047Z ========== 2024-04-03T11:40:43.1971362Z HSA Agents 2024-04-03T11:40:43.1973025Z ========== 2024-04-03T11:40:43.1974330Z ******* 2024-04-03T11:40:43.1975744Z Agent 1 2024-04-03T11:40:43.2075454Z ******* 2024-04-03T11:40:43.2076860Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2078054Z Uuid: CPU-XX 2024-04-03T11:40:43.2079465Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2081471Z Vendor Name: CPU 2024-04-03T11:40:43.2082847Z Feature: None specified 2024-04-03T11:40:43.2083908Z Profile: FULL_PROFILE 2024-04-03T11:40:43.2085136Z Float Round Mode: NEAR 2024-04-03T11:40:43.2086342Z Max Queue Number: 0(0x0) 2024-04-03T11:40:43.2087584Z Queue Min Size: 0(0x0) 2024-04-03T11:40:43.2088821Z Queue Max Size: 0(0x0) 2024-04-03T11:40:43.2089922Z Queue Type: MULTI 2024-04-03T11:40:43.2091188Z Node: 0 2024-04-03T11:40:43.2092100Z Device Type: CPU 2024-04-03T11:40:43.2092764Z Cache Info: 2024-04-03T11:40:43.2093312Z L1: 32768(0x8000) KB 2024-04-03T11:40:43.2094043Z Chip ID: 0(0x0) 2024-04-03T11:40:43.2094807Z ASIC Revision: 0(0x0) 2024-04-03T11:40:43.2095578Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2096377Z Max Clock Freq. (MHz): 2200 2024-04-03T11:40:43.2097137Z BDFID: 0 2024-04-03T11:40:43.2097888Z Internal Node ID: 0 2024-04-03T11:40:43.2098915Z Compute Unit: 16 2024-04-03T11:40:43.2099816Z SIMDs per CU: 0 2024-04-03T11:40:43.2100602Z Shader Engines: 0 2024-04-03T11:40:43.2101395Z Shader Arrs. per Eng.: 0 2024-04-03T11:40:43.2102225Z WatchPts on Addr. Ranges:1 2024-04-03T11:40:43.2102921Z Features: None 2024-04-03T11:40:43.2103413Z Pool Info: 2024-04-03T11:40:43.2103896Z Pool 1 2024-04-03T11:40:43.2104515Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:40:43.2105277Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:40:43.2106049Z Allocatable: TRUE 2024-04-03T11:40:43.2106853Z Alloc Granule: 4KB 2024-04-03T11:40:43.2107663Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2108487Z Accessible by all: TRUE 2024-04-03T11:40:43.2109170Z Pool 2 2024-04-03T11:40:43.2109788Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:40:43.2110572Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:40:43.2111348Z Allocatable: TRUE 2024-04-03T11:40:43.2112146Z Alloc Granule: 4KB 2024-04-03T11:40:43.2112947Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2113770Z Accessible by all: TRUE 2024-04-03T11:40:43.2114451Z Pool 3 2024-04-03T11:40:43.2115062Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2115849Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:40:43.2116623Z Allocatable: TRUE 2024-04-03T11:40:43.2117370Z Alloc Granule: 4KB 2024-04-03T11:40:43.2118053Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2118755Z Accessible by all: TRUE 2024-04-03T11:40:43.2119321Z ISA Info: 2024-04-03T11:40:43.2119820Z ******* 2024-04-03T11:40:43.2120206Z Agent 2 2024-04-03T11:40:43.2120570Z ******* 2024-04-03T11:40:43.2121151Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2121843Z Uuid: CPU-XX 2024-04-03T11:40:43.2122566Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2123258Z Vendor Name: CPU 2024-04-03T11:40:43.2123919Z Feature: None specified 2024-04-03T11:40:43.2124575Z Profile: FULL_PROFILE 2024-04-03T11:40:43.2125240Z Float Round Mode: NEAR 2024-04-03T11:40:43.2125920Z Max Queue Number: 0(0x0) 2024-04-03T11:40:43.2126592Z Queue Min Size: 0(0x0) 2024-04-03T11:40:43.2127254Z Queue Max Size: 0(0x0) 2024-04-03T11:40:43.2127917Z Queue Type: MULTI 2024-04-03T11:40:43.2128543Z Node: 1 2024-04-03T11:40:43.2129162Z Device Type: CPU 2024-04-03T11:40:43.2129716Z Cache Info: 2024-04-03T11:40:43.2130194Z L1: 32768(0x8000) KB 2024-04-03T11:40:43.2130916Z Chip ID: 0(0x0) 2024-04-03T11:40:43.2131561Z ASIC Revision: 0(0x0) 2024-04-03T11:40:43.2132225Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2132894Z Max Clock Freq. (MHz): 2200 2024-04-03T11:40:43.2133542Z BDFID: 0 2024-04-03T11:40:43.2134195Z Internal Node ID: 1 2024-04-03T11:40:43.2134868Z Compute Unit: 16 2024-04-03T11:40:43.2135513Z SIMDs per CU: 0 2024-04-03T11:40:43.2136173Z Shader Engines: 0 2024-04-03T11:40:43.2136855Z Shader Arrs. per Eng.: 0 2024-04-03T11:40:43.2137548Z WatchPts on Addr. Ranges:1 2024-04-03T11:40:43.2138143Z Features: None 2024-04-03T11:40:43.2138568Z Pool Info: 2024-04-03T11:40:43.2138963Z Pool 1 2024-04-03T11:40:43.2139484Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:40:43.2140184Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2140929Z Allocatable: TRUE 2024-04-03T11:40:43.2141735Z Alloc Granule: 4KB 2024-04-03T11:40:43.2142550Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2143376Z Accessible by all: TRUE 2024-04-03T11:40:43.2144069Z Pool 2 2024-04-03T11:40:43.2144702Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:40:43.2145491Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2146277Z Allocatable: TRUE 2024-04-03T11:40:43.2147093Z Alloc Granule: 4KB 2024-04-03T11:40:43.2147927Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2148750Z Accessible by all: TRUE 2024-04-03T11:40:43.2149437Z Pool 3 2024-04-03T11:40:43.2150066Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2155993Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2156815Z Allocatable: TRUE 2024-04-03T11:40:43.2157523Z Alloc Granule: 4KB 2024-04-03T11:40:43.2158195Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2158887Z Accessible by all: TRUE 2024-04-03T11:40:43.2159464Z ISA Info: 2024-04-03T11:40:43.2159851Z ******* 2024-04-03T11:40:43.2160231Z Agent 3 2024-04-03T11:40:43.2160600Z ******* 2024-04-03T11:40:43.2161163Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2161860Z Uuid: CPU-XX 2024-04-03T11:40:43.2162597Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2163282Z Vendor Name: CPU 2024-04-03T11:40:43.2163941Z Feature: None specified 2024-04-03T11:40:43.2164591Z Profile: FULL_PROFILE 2024-04-03T11:40:43.2165260Z Float Round Mode: NEAR 2024-04-03T11:40:43.2165935Z Max Queue Number: 0(0x0) 2024-04-03T11:40:43.2166612Z Queue Min Size: 0(0x0) 2024-04-03T11:40:43.2167434Z Queue Max Size: 0(0x0) 2024-04-03T11:40:43.2168086Z Queue Type: MULTI 2024-04-03T11:40:43.2168711Z Node: 2 2024-04-03T11:40:43.2169340Z Device Type: CPU 2024-04-03T11:40:43.2169892Z Cache Info: 2024-04-03T11:40:43.2170364Z L1: 32768(0x8000) KB 2024-04-03T11:40:43.2170985Z Chip ID: 0(0x0) 2024-04-03T11:40:43.2171625Z ASIC Revision: 0(0x0) 2024-04-03T11:40:43.2172292Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2172971Z Max Clock Freq. (MHz): 2200 2024-04-03T11:40:43.2173609Z BDFID: 0 2024-04-03T11:40:43.2174251Z Internal Node ID: 2 2024-04-03T11:40:43.2174922Z Compute Unit: 16 2024-04-03T11:40:43.2175575Z SIMDs per CU: 0 2024-04-03T11:40:43.2176231Z Shader Engines: 0 2024-04-03T11:40:43.2176924Z Shader Arrs. per Eng.: 0 2024-04-03T11:40:43.2177638Z WatchPts on Addr. Ranges:1 2024-04-03T11:40:43.2178227Z Features: None 2024-04-03T11:40:43.2178654Z Pool Info: 2024-04-03T11:40:43.2179057Z Pool 1 2024-04-03T11:40:43.2179575Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:40:43.2180478Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2181261Z Allocatable: TRUE 2024-04-03T11:40:43.2182076Z Alloc Granule: 4KB 2024-04-03T11:40:43.2182912Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2183758Z Accessible by all: TRUE 2024-04-03T11:40:43.2184450Z Pool 2 2024-04-03T11:40:43.2185086Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:40:43.2185890Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2186803Z Allocatable: TRUE 2024-04-03T11:40:43.2187617Z Alloc Granule: 4KB 2024-04-03T11:40:43.2188455Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2189292Z Accessible by all: TRUE 2024-04-03T11:40:43.2189973Z Pool 3 2024-04-03T11:40:43.2190604Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2191411Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:40:43.2192173Z Allocatable: TRUE 2024-04-03T11:40:43.2192983Z Alloc Granule: 4KB 2024-04-03T11:40:43.2193802Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2194619Z Accessible by all: TRUE 2024-04-03T11:40:43.2195316Z ISA Info: 2024-04-03T11:40:43.2195785Z ******* 2024-04-03T11:40:43.2196224Z Agent 4 2024-04-03T11:40:43.2196675Z ******* 2024-04-03T11:40:43.2197303Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2198077Z Uuid: CPU-XX 2024-04-03T11:40:43.2199051Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:40:43.2199888Z Vendor Name: CPU 2024-04-03T11:40:43.2200973Z Feature: None specified 2024-04-03T11:40:43.2201802Z Profile: FULL_PROFILE 2024-04-03T11:40:43.2202610Z Float Round Mode: NEAR 2024-04-03T11:40:43.2203439Z Max Queue Number: 0(0x0) 2024-04-03T11:40:43.2204234Z Queue Min Size: 0(0x0) 2024-04-03T11:40:43.2205048Z Queue Max Size: 0(0x0) 2024-04-03T11:40:43.2205843Z Queue Type: MULTI 2024-04-03T11:40:43.2206590Z Node: 3 2024-04-03T11:40:43.2207322Z Device Type: CPU 2024-04-03T11:40:43.2207885Z Cache Info: 2024-04-03T11:40:43.2208446Z L1: 32768(0x8000) KB 2024-04-03T11:40:43.2209184Z Chip ID: 0(0x0) 2024-04-03T11:40:43.2209952Z ASIC Revision: 0(0x0) 2024-04-03T11:40:43.2210743Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2211555Z Max Clock Freq. (MHz): 2200 2024-04-03T11:40:43.2212334Z BDFID: 0 2024-04-03T11:40:43.2213105Z Internal Node ID: 3 2024-04-03T11:40:43.2213913Z Compute Unit: 16 2024-04-03T11:40:43.2214700Z SIMDs per CU: 0 2024-04-03T11:40:43.2215493Z Shader Engines: 0 2024-04-03T11:40:43.2216324Z Shader Arrs. per Eng.: 0 2024-04-03T11:40:43.2217115Z WatchPts on Addr. Ranges:1 2024-04-03T11:40:43.2217715Z Features: None 2024-04-03T11:40:43.2218138Z Pool Info: 2024-04-03T11:40:43.2218548Z Pool 1 2024-04-03T11:40:43.2219071Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:40:43.2219810Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:40:43.2220498Z Allocatable: TRUE 2024-04-03T11:40:43.2221454Z Alloc Granule: 4KB 2024-04-03T11:40:43.2222274Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2223117Z Accessible by all: TRUE 2024-04-03T11:40:43.2223813Z Pool 2 2024-04-03T11:40:43.2224441Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:40:43.2225237Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:40:43.2226021Z Allocatable: TRUE 2024-04-03T11:40:43.2226841Z Alloc Granule: 4KB 2024-04-03T11:40:43.2227661Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2228509Z Accessible by all: TRUE 2024-04-03T11:40:43.2229200Z Pool 3 2024-04-03T11:40:43.2229828Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2230648Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:40:43.2231427Z Allocatable: TRUE 2024-04-03T11:40:43.2232240Z Alloc Granule: 4KB 2024-04-03T11:40:43.2233058Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2233898Z Accessible by all: TRUE 2024-04-03T11:40:43.2234584Z ISA Info: 2024-04-03T11:40:43.2235187Z ******* 2024-04-03T11:40:43.2235635Z Agent 5 2024-04-03T11:40:43.2236079Z ******* 2024-04-03T11:40:43.2236629Z Name: gfx906 2024-04-03T11:40:43.2237377Z Uuid: GPU-c406418172e62692 2024-04-03T11:40:43.2238037Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:40:43.2238727Z Vendor Name: AMD 2024-04-03T11:40:43.2239381Z Feature: KERNEL_DISPATCH 2024-04-03T11:40:43.2240024Z Profile: BASE_PROFILE 2024-04-03T11:40:43.2240693Z Float Round Mode: NEAR 2024-04-03T11:40:43.2241378Z Max Queue Number: 128(0x80) 2024-04-03T11:40:43.2242045Z Queue Min Size: 64(0x40) 2024-04-03T11:40:43.2242719Z Queue Max Size: 131072(0x20000) 2024-04-03T11:40:43.2243381Z Queue Type: MULTI 2024-04-03T11:40:43.2244007Z Node: 4 2024-04-03T11:40:43.2244631Z Device Type: GPU 2024-04-03T11:40:43.2245197Z Cache Info: 2024-04-03T11:40:43.2245673Z L1: 16(0x10) KB 2024-04-03T11:40:43.2246266Z L2: 8192(0x2000) KB 2024-04-03T11:40:43.2246891Z Chip ID: 26273(0x66a1) 2024-04-03T11:40:43.2247543Z ASIC Revision: 1(0x1) 2024-04-03T11:40:43.2248202Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2248877Z Max Clock Freq. (MHz): 1800 2024-04-03T11:40:43.2249526Z BDFID: 8960 2024-04-03T11:40:43.2250165Z Internal Node ID: 4 2024-04-03T11:40:43.2250832Z Compute Unit: 64 2024-04-03T11:40:43.2251492Z SIMDs per CU: 4 2024-04-03T11:40:43.2252147Z Shader Engines: 4 2024-04-03T11:40:43.2252829Z Shader Arrs. per Eng.: 1 2024-04-03T11:40:43.2253633Z WatchPts on Addr. Ranges:4 2024-04-03T11:40:43.2254261Z Features: KERNEL_DISPATCH 2024-04-03T11:40:43.2254869Z Fast F16 Operation: TRUE 2024-04-03T11:40:43.2255550Z Wavefront Size: 64(0x40) 2024-04-03T11:40:43.2256225Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2256815Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2257361Z x 1024(0x400) 2024-04-03T11:40:43.2257941Z y 1024(0x400) 2024-04-03T11:40:43.2258504Z z 1024(0x400) 2024-04-03T11:40:43.2259144Z Max Waves Per CU: 40(0x28) 2024-04-03T11:40:43.2260096Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:40:43.2260871Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2261580Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2262180Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2262877Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2263596Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2264379Z Max fbarriers/Workgrp: 32 2024-04-03T11:40:43.2265075Z Pool Info: 2024-04-03T11:40:43.2265716Z Pool 1 2024-04-03T11:40:43.2266357Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2267153Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:40:43.2267924Z Allocatable: TRUE 2024-04-03T11:40:43.2268739Z Alloc Granule: 4KB 2024-04-03T11:40:43.2269561Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2270392Z Accessible by all: FALSE 2024-04-03T11:40:43.2271091Z Pool 2 2024-04-03T11:40:43.2271695Z Segment: GROUP 2024-04-03T11:40:43.2272452Z Size: 64(0x40) KB 2024-04-03T11:40:43.2273226Z Allocatable: FALSE 2024-04-03T11:40:43.2274031Z Alloc Granule: 0KB 2024-04-03T11:40:43.2274855Z Alloc Alignment: 0KB 2024-04-03T11:40:43.2275693Z Accessible by all: FALSE 2024-04-03T11:40:43.2276380Z ISA Info: 2024-04-03T11:40:43.2276850Z ISA 1 2024-04-03T11:40:43.2277558Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:40:43.2278278Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:40:43.2278969Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:40:43.2279666Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2280383Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2281065Z Fast f16: TRUE 2024-04-03T11:40:43.2281733Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2282354Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2282896Z x 1024(0x400) 2024-04-03T11:40:43.2283476Z y 1024(0x400) 2024-04-03T11:40:43.2284061Z z 1024(0x400) 2024-04-03T11:40:43.2284711Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2285413Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2285941Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2286544Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2287136Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2287788Z FBarrier Max Size: 32 2024-04-03T11:40:43.2288356Z ******* 2024-04-03T11:40:43.2288724Z Agent 6 2024-04-03T11:40:43.2289108Z ******* 2024-04-03T11:40:43.2289568Z Name: gfx906 2024-04-03T11:40:43.2290248Z Uuid: GPU-3014910172e62698 2024-04-03T11:40:43.2290914Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:40:43.2291592Z Vendor Name: AMD 2024-04-03T11:40:43.2292247Z Feature: KERNEL_DISPATCH 2024-04-03T11:40:43.2292897Z Profile: BASE_PROFILE 2024-04-03T11:40:43.2293564Z Float Round Mode: NEAR 2024-04-03T11:40:43.2294258Z Max Queue Number: 128(0x80) 2024-04-03T11:40:43.2294925Z Queue Min Size: 64(0x40) 2024-04-03T11:40:43.2295591Z Queue Max Size: 131072(0x20000) 2024-04-03T11:40:43.2296251Z Queue Type: MULTI 2024-04-03T11:40:43.2296997Z Node: 5 2024-04-03T11:40:43.2297632Z Device Type: GPU 2024-04-03T11:40:43.2298190Z Cache Info: 2024-04-03T11:40:43.2298657Z L1: 16(0x10) KB 2024-04-03T11:40:43.2299254Z L2: 8192(0x2000) KB 2024-04-03T11:40:43.2299986Z Chip ID: 26273(0x66a1) 2024-04-03T11:40:43.2300684Z ASIC Revision: 1(0x1) 2024-04-03T11:40:43.2301468Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2302267Z Max Clock Freq. (MHz): 1800 2024-04-03T11:40:43.2303024Z BDFID: 9728 2024-04-03T11:40:43.2303776Z Internal Node ID: 5 2024-04-03T11:40:43.2304583Z Compute Unit: 64 2024-04-03T11:40:43.2305354Z SIMDs per CU: 4 2024-04-03T11:40:43.2306129Z Shader Engines: 4 2024-04-03T11:40:43.2306934Z Shader Arrs. per Eng.: 1 2024-04-03T11:40:43.2307767Z WatchPts on Addr. Ranges:4 2024-04-03T11:40:43.2308491Z Features: KERNEL_DISPATCH 2024-04-03T11:40:43.2309215Z Fast F16 Operation: TRUE 2024-04-03T11:40:43.2310018Z Wavefront Size: 64(0x40) 2024-04-03T11:40:43.2310813Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2311524Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2312145Z x 1024(0x400) 2024-04-03T11:40:43.2312817Z y 1024(0x400) 2024-04-03T11:40:43.2313513Z z 1024(0x400) 2024-04-03T11:40:43.2314267Z Max Waves Per CU: 40(0x28) 2024-04-03T11:40:43.2315187Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:40:43.2316004Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2316696Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2317423Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2318015Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2318608Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2319265Z Max fbarriers/Workgrp: 32 2024-04-03T11:40:43.2319842Z Pool Info: 2024-04-03T11:40:43.2320246Z Pool 1 2024-04-03T11:40:43.2320777Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2321433Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:40:43.2322080Z Allocatable: TRUE 2024-04-03T11:40:43.2322761Z Alloc Granule: 4KB 2024-04-03T11:40:43.2323434Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2324128Z Accessible by all: FALSE 2024-04-03T11:40:43.2324724Z Pool 2 2024-04-03T11:40:43.2325222Z Segment: GROUP 2024-04-03T11:40:43.2325855Z Size: 64(0x40) KB 2024-04-03T11:40:43.2326500Z Allocatable: FALSE 2024-04-03T11:40:43.2327177Z Alloc Granule: 0KB 2024-04-03T11:40:43.2327851Z Alloc Alignment: 0KB 2024-04-03T11:40:43.2328654Z Accessible by all: FALSE 2024-04-03T11:40:43.2329236Z ISA Info: 2024-04-03T11:40:43.2329670Z ISA 1 2024-04-03T11:40:43.2330405Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:40:43.2331253Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:40:43.2332069Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:40:43.2332893Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2333740Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2334535Z Fast f16: TRUE 2024-04-03T11:40:43.2335329Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2336058Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2336702Z x 1024(0x400) 2024-04-03T11:40:43.2337397Z y 1024(0x400) 2024-04-03T11:40:43.2338093Z z 1024(0x400) 2024-04-03T11:40:43.2338857Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2339553Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2340361Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2341089Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2341790Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2342558Z FBarrier Max Size: 32 2024-04-03T11:40:43.2343226Z ******* 2024-04-03T11:40:43.2343662Z Agent 7 2024-04-03T11:40:43.2344108Z ******* 2024-04-03T11:40:43.2344646Z Name: gfx906 2024-04-03T11:40:43.2345468Z Uuid: GPU-72a690c172edb148 2024-04-03T11:40:43.2346262Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:40:43.2347074Z Vendor Name: AMD 2024-04-03T11:40:43.2347838Z Feature: KERNEL_DISPATCH 2024-04-03T11:40:43.2348609Z Profile: BASE_PROFILE 2024-04-03T11:40:43.2349557Z Float Round Mode: NEAR 2024-04-03T11:40:43.2350382Z Max Queue Number: 128(0x80) 2024-04-03T11:40:43.2351202Z Queue Min Size: 64(0x40) 2024-04-03T11:40:43.2352004Z Queue Max Size: 131072(0x20000) 2024-04-03T11:40:43.2352806Z Queue Type: MULTI 2024-04-03T11:40:43.2353556Z Node: 6 2024-04-03T11:40:43.2354312Z Device Type: GPU 2024-04-03T11:40:43.2354980Z Cache Info: 2024-04-03T11:40:43.2355543Z L1: 16(0x10) KB 2024-04-03T11:40:43.2356258Z L2: 8192(0x2000) KB 2024-04-03T11:40:43.2357002Z Chip ID: 26273(0x66a1) 2024-04-03T11:40:43.2357711Z ASIC Revision: 1(0x1) 2024-04-03T11:40:43.2358382Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2359060Z Max Clock Freq. (MHz): 1800 2024-04-03T11:40:43.2359698Z BDFID: 25344 2024-04-03T11:40:43.2360351Z Internal Node ID: 6 2024-04-03T11:40:43.2361017Z Compute Unit: 64 2024-04-03T11:40:43.2361768Z SIMDs per CU: 4 2024-04-03T11:40:43.2362436Z Shader Engines: 4 2024-04-03T11:40:43.2363121Z Shader Arrs. per Eng.: 1 2024-04-03T11:40:43.2363828Z WatchPts on Addr. Ranges:4 2024-04-03T11:40:43.2364442Z Features: KERNEL_DISPATCH 2024-04-03T11:40:43.2365056Z Fast F16 Operation: TRUE 2024-04-03T11:40:43.2365747Z Wavefront Size: 64(0x40) 2024-04-03T11:40:43.2366422Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2367023Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2367548Z x 1024(0x400) 2024-04-03T11:40:43.2368117Z y 1024(0x400) 2024-04-03T11:40:43.2368690Z z 1024(0x400) 2024-04-03T11:40:43.2369336Z Max Waves Per CU: 40(0x28) 2024-04-03T11:40:43.2370105Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:40:43.2370851Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2371441Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2371928Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2372523Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2373117Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2373770Z Max fbarriers/Workgrp: 32 2024-04-03T11:40:43.2374353Z Pool Info: 2024-04-03T11:40:43.2374756Z Pool 1 2024-04-03T11:40:43.2375285Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2375935Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:40:43.2376588Z Allocatable: TRUE 2024-04-03T11:40:43.2377266Z Alloc Granule: 4KB 2024-04-03T11:40:43.2377938Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2378639Z Accessible by all: FALSE 2024-04-03T11:40:43.2379222Z Pool 2 2024-04-03T11:40:43.2379902Z Segment: GROUP 2024-04-03T11:40:43.2380589Z Size: 64(0x40) KB 2024-04-03T11:40:43.2381353Z Allocatable: FALSE 2024-04-03T11:40:43.2382144Z Alloc Granule: 0KB 2024-04-03T11:40:43.2382957Z Alloc Alignment: 0KB 2024-04-03T11:40:43.2383795Z Accessible by all: FALSE 2024-04-03T11:40:43.2384492Z ISA Info: 2024-04-03T11:40:43.2384971Z ISA 1 2024-04-03T11:40:43.2385715Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:40:43.2386574Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:40:43.2387403Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:40:43.2388234Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2389107Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2389913Z Fast f16: TRUE 2024-04-03T11:40:43.2390744Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2391479Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2392126Z x 1024(0x400) 2024-04-03T11:40:43.2392842Z y 1024(0x400) 2024-04-03T11:40:43.2393696Z z 1024(0x400) 2024-04-03T11:40:43.2394461Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2395181Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2395814Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2396537Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2397269Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2397952Z FBarrier Max Size: 32 2024-04-03T11:40:43.2398522Z ******* 2024-04-03T11:40:43.2398890Z Agent 8 2024-04-03T11:40:43.2399269Z ******* 2024-04-03T11:40:43.2399727Z Name: gfx906 2024-04-03T11:40:43.2400413Z Uuid: GPU-301890c172e62698 2024-04-03T11:40:43.2401088Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:40:43.2401769Z Vendor Name: AMD 2024-04-03T11:40:43.2402413Z Feature: KERNEL_DISPATCH 2024-04-03T11:40:43.2403067Z Profile: BASE_PROFILE 2024-04-03T11:40:43.2403736Z Float Round Mode: NEAR 2024-04-03T11:40:43.2404418Z Max Queue Number: 128(0x80) 2024-04-03T11:40:43.2405093Z Queue Min Size: 64(0x40) 2024-04-03T11:40:43.2405764Z Queue Max Size: 131072(0x20000) 2024-04-03T11:40:43.2406418Z Queue Type: MULTI 2024-04-03T11:40:43.2407049Z Node: 7 2024-04-03T11:40:43.2407683Z Device Type: GPU 2024-04-03T11:40:43.2408242Z Cache Info: 2024-04-03T11:40:43.2408720Z L1: 16(0x10) KB 2024-04-03T11:40:43.2409324Z L2: 8192(0x2000) KB 2024-04-03T11:40:43.2409938Z Chip ID: 26273(0x66a1) 2024-04-03T11:40:43.2410597Z ASIC Revision: 1(0x1) 2024-04-03T11:40:43.2411378Z Cacheline Size: 64(0x40) 2024-04-03T11:40:43.2412065Z Max Clock Freq. (MHz): 1800 2024-04-03T11:40:43.2412704Z BDFID: 26112 2024-04-03T11:40:43.2413354Z Internal Node ID: 7 2024-04-03T11:40:43.2414025Z Compute Unit: 64 2024-04-03T11:40:43.2414681Z SIMDs per CU: 4 2024-04-03T11:40:43.2415461Z Shader Engines: 4 2024-04-03T11:40:43.2416271Z Shader Arrs. per Eng.: 1 2024-04-03T11:40:43.2417101Z WatchPts on Addr. Ranges:4 2024-04-03T11:40:43.2417837Z Features: KERNEL_DISPATCH 2024-04-03T11:40:43.2418562Z Fast F16 Operation: TRUE 2024-04-03T11:40:43.2419355Z Wavefront Size: 64(0x40) 2024-04-03T11:40:43.2420260Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2420984Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2421609Z x 1024(0x400) 2024-04-03T11:40:43.2422308Z y 1024(0x400) 2024-04-03T11:40:43.2423009Z z 1024(0x400) 2024-04-03T11:40:43.2423775Z Max Waves Per CU: 40(0x28) 2024-04-03T11:40:43.2424841Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:40:43.2425686Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2426404Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2426995Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2427716Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2428447Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2429238Z Max fbarriers/Workgrp: 32 2024-04-03T11:40:43.2429959Z Pool Info: 2024-04-03T11:40:43.2430451Z Pool 1 2024-04-03T11:40:43.2431081Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:40:43.2431891Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:40:43.2432692Z Allocatable: TRUE 2024-04-03T11:40:43.2433508Z Alloc Granule: 4KB 2024-04-03T11:40:43.2434349Z Alloc Alignment: 4KB 2024-04-03T11:40:43.2435210Z Accessible by all: FALSE 2024-04-03T11:40:43.2435910Z Pool 2 2024-04-03T11:40:43.2436516Z Segment: GROUP 2024-04-03T11:40:43.2437303Z Size: 64(0x40) KB 2024-04-03T11:40:43.2438017Z Allocatable: FALSE 2024-04-03T11:40:43.2438686Z Alloc Granule: 0KB 2024-04-03T11:40:43.2439375Z Alloc Alignment: 0KB 2024-04-03T11:40:43.2440091Z Accessible by all: FALSE 2024-04-03T11:40:43.2440666Z ISA Info: 2024-04-03T11:40:43.2441075Z ISA 1 2024-04-03T11:40:43.2441697Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:40:43.2442411Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:40:43.2443126Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:40:43.2443831Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2444555Z Default Rounding Mode: NEAR 2024-04-03T11:40:43.2445350Z Fast f16: TRUE 2024-04-03T11:40:43.2446045Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:40:43.2446662Z Workgroup Max Size per Dimension: 2024-04-03T11:40:43.2447214Z x 1024(0x400) 2024-04-03T11:40:43.2447809Z y 1024(0x400) 2024-04-03T11:40:43.2448403Z z 1024(0x400) 2024-04-03T11:40:43.2449062Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:40:43.2449665Z Grid Max Size per Dimension: 2024-04-03T11:40:43.2450195Z x 4294967295(0xffffffff) 2024-04-03T11:40:43.2450799Z y 4294967295(0xffffffff) 2024-04-03T11:40:43.2451412Z z 4294967295(0xffffffff) 2024-04-03T11:40:43.2452082Z FBarrier Max Size: 32 2024-04-03T11:40:43.2452653Z *** Done *** 2024-04-03T11:40:43.2456545Z ##[debug]Finished: run 2024-04-03T11:40:43.2459379Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.2461444Z ##[debug]Evaluating: always() 2024-04-03T11:40:43.2462140Z ##[debug]Evaluating always: 2024-04-03T11:40:43.2462766Z ##[debug]=> true 2024-04-03T11:40:43.2463427Z ##[debug]Result: true 2024-04-03T11:40:43.2464361Z ##[debug]Starting: run 2024-04-03T11:40:43.2477128Z ##[debug]Loading inputs 2024-04-03T11:40:43.2478251Z ##[debug]Loading env 2024-04-03T11:40:43.2484266Z ##[group]Run ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2024-04-03T11:40:43.2484962Z ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2024-04-03T11:40:43.2486206Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.2487380Z if [[ $ngpu -eq 0 ]]; then 2024-04-03T11:40:43.2487992Z  echo "Error: Failed to detect any GPUs on the runner" 2024-04-03T11:40:43.2488621Z  echo "$msg" 2024-04-03T11:40:43.2489003Z  exit 1 2024-04-03T11:40:43.2489341Z fi 2024-04-03T11:40:43.2489685Z if [[ $ngpu -eq 1 ]]; then 2024-04-03T11:40:43.2490476Z  echo "Error: only 1 GPU detected, at least 2 GPUs are needed for distributed jobs" 2024-04-03T11:40:43.2491273Z  echo "$msg" 2024-04-03T11:40:43.2491660Z  exit 1 2024-04-03T11:40:43.2492006Z fi 2024-04-03T11:40:43.2526088Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:43.2526653Z env: 2024-04-03T11:40:43.2526966Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.2527441Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.2527949Z ##[endgroup] 2024-04-03T11:40:43.2557524Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/7a46aaaf-404f-494a-a83e-74e3d6a3394e.sh 2024-04-03T11:40:43.3496465Z ##[debug]Finished: run 2024-04-03T11:40:43.3503126Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.3506300Z ##[debug]Evaluating: always() 2024-04-03T11:40:43.3507792Z ##[debug]Evaluating always: 2024-04-03T11:40:43.3509173Z ##[debug]=> true 2024-04-03T11:40:43.3510587Z ##[debug]Result: true 2024-04-03T11:40:43.3512249Z ##[debug]Starting: run 2024-04-03T11:40:43.3577400Z Prepare all required actions 2024-04-03T11:40:43.3620082Z ##[debug]Loading inputs 2024-04-03T11:40:43.3622008Z ##[debug]Loading env 2024-04-03T11:40:43.3628939Z ##[group]Run ./.github/actions/diskspace-cleanup 2024-04-03T11:40:43.3629542Z with: 2024-04-03T11:40:43.3629871Z diskspace-cutoff: 70 2024-04-03T11:40:43.3630216Z env: 2024-04-03T11:40:43.3630515Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.3630976Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.3631479Z ##[endgroup] 2024-04-03T11:40:43.3633672Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.3634837Z ##[debug]Evaluating: success() 2024-04-03T11:40:43.3635432Z ##[debug]Evaluating success: 2024-04-03T11:40:43.3636023Z ##[debug]=> true 2024-04-03T11:40:43.3636572Z ##[debug]Result: true 2024-04-03T11:40:43.3637206Z ##[debug]Starting: run 2024-04-03T11:40:43.3646125Z ##[debug]Loading inputs 2024-04-03T11:40:43.3649561Z ##[debug]Evaluating: format('diskspace_cutoff={0} 2024-04-03T11:40:43.3650352Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3651762Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.3653014Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3653602Z ##[debug] docker system prune -af 2024-04-03T11:40:43.3654353Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3655258Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3656257Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T11:40:43.3657122Z ##[debug] echo "$msg" 2024-04-03T11:40:43.3657523Z ##[debug] exit 1 2024-04-03T11:40:43.3657877Z ##[debug] else 2024-04-03T11:40:43.3658320Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T11:40:43.3659200Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T11:40:43.3659925Z ##[debug] fi 2024-04-03T11:40:43.3660242Z ##[debug]fi 2024-04-03T11:40:43.3660646Z ##[debug]', inputs.diskspace-cutoff) 2024-04-03T11:40:43.3661249Z ##[debug]Evaluating format: 2024-04-03T11:40:43.3681476Z ##[debug]..Evaluating String: 2024-04-03T11:40:43.3682020Z ##[debug]..=> 'diskspace_cutoff={0} 2024-04-03T11:40:43.3682744Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3684175Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.3685422Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3686028Z ##[debug] docker system prune -af 2024-04-03T11:40:43.3686786Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3687689Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3688698Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T11:40:43.3689561Z ##[debug] echo "$msg" 2024-04-03T11:40:43.3689959Z ##[debug] exit 1 2024-04-03T11:40:43.3690311Z ##[debug] else 2024-04-03T11:40:43.3690749Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T11:40:43.3691409Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T11:40:43.3691961Z ##[debug] fi 2024-04-03T11:40:43.3692288Z ##[debug]fi 2024-04-03T11:40:43.3692579Z ##[debug]' 2024-04-03T11:40:43.3719421Z ##[debug]..Evaluating Index: 2024-04-03T11:40:43.3719876Z ##[debug]....Evaluating inputs: 2024-04-03T11:40:43.3720334Z ##[debug]....=> Object 2024-04-03T11:40:43.3720764Z ##[debug]....Evaluating String: 2024-04-03T11:40:43.3721229Z ##[debug]....=> 'diskspace-cutoff' 2024-04-03T11:40:43.3721693Z ##[debug]..=> '70' 2024-04-03T11:40:43.3723099Z ##[debug]=> 'diskspace_cutoff=70 2024-04-03T11:40:43.3723806Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3725213Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.3726469Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3727057Z ##[debug] docker system prune -af 2024-04-03T11:40:43.3728040Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3728956Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3729958Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T11:40:43.3730828Z ##[debug] echo "$msg" 2024-04-03T11:40:43.3731227Z ##[debug] exit 1 2024-04-03T11:40:43.3731587Z ##[debug] else 2024-04-03T11:40:43.3732045Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T11:40:43.3732717Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T11:40:43.3733275Z ##[debug] fi 2024-04-03T11:40:43.3733585Z ##[debug]fi 2024-04-03T11:40:43.3733877Z ##[debug]' 2024-04-03T11:40:43.3734348Z ##[debug]Result: 'diskspace_cutoff=70 2024-04-03T11:40:43.3735077Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3736482Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.3737726Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3738319Z ##[debug] docker system prune -af 2024-04-03T11:40:43.3739049Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T11:40:43.3740016Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3741332Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T11:40:43.3742342Z ##[debug] echo "$msg" 2024-04-03T11:40:43.3742798Z ##[debug] exit 1 2024-04-03T11:40:43.3743221Z ##[debug] else 2024-04-03T11:40:43.3743744Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T11:40:43.3744517Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T11:40:43.3745179Z ##[debug] fi 2024-04-03T11:40:43.3745561Z ##[debug]fi 2024-04-03T11:40:43.3745901Z ##[debug]' 2024-04-03T11:40:43.3746424Z ##[debug]Loading env 2024-04-03T11:40:43.3753462Z ##[group]Run diskspace_cutoff=70 2024-04-03T11:40:43.3754047Z diskspace_cutoff=70 2024-04-03T11:40:43.3754887Z diskspace=$(df -H / --output=pcent | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-04-03T11:40:43.3756769Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T11:40:43.3758332Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3759087Z  docker system prune -af 2024-04-03T11:40:43.3760000Z  diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-04-03T11:40:43.3760958Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T11:40:43.3762009Z  echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T11:40:43.3762899Z  echo "$msg" 2024-04-03T11:40:43.3763309Z  exit 1 2024-04-03T11:40:43.3763673Z  else 2024-04-03T11:40:43.3764148Z  difference=$((diskspace - diskspace_new)) 2024-04-03T11:40:43.3764863Z  echo "Diskspace saved: $difference percent" 2024-04-03T11:40:43.3765496Z  fi 2024-04-03T11:40:43.3765829Z fi 2024-04-03T11:40:43.3800817Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:43.3801381Z env: 2024-04-03T11:40:43.3801691Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.3802158Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.3802657Z ##[endgroup] 2024-04-03T11:40:43.3832479Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/af438125-0a48-46b0-84b2-23adb92a6c78.sh 2024-04-03T11:40:43.3927672Z ##[debug]Finished: run 2024-04-03T11:40:43.3938456Z ##[debug]Finished: run 2024-04-03T11:40:43.3947738Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.3953213Z ##[debug]Evaluating: failure() 2024-04-03T11:40:43.3954312Z ##[debug]Evaluating failure: 2024-04-03T11:40:43.3959829Z ##[debug]=> false 2024-04-03T11:40:43.3961081Z ##[debug]Result: false 2024-04-03T11:40:43.3965659Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.3967766Z ##[debug]Evaluating: success() 2024-04-03T11:40:43.3968899Z ##[debug]Evaluating success: 2024-04-03T11:40:43.3969975Z ##[debug]=> true 2024-04-03T11:40:43.3971022Z ##[debug]Result: true 2024-04-03T11:40:43.3972242Z ##[debug]Starting: run 2024-04-03T11:40:43.3990870Z ##[debug]Loading inputs 2024-04-03T11:40:43.3992782Z ##[debug]Loading env 2024-04-03T11:40:43.4004161Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-04-03T11:40:43.4005687Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-04-03T11:40:43.4007104Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-04-03T11:40:43.4062057Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:43.4063113Z env: 2024-04-03T11:40:43.4063685Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.4064528Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.4065487Z ##[endgroup] 2024-04-03T11:40:43.4094066Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/ec8827e9-3cf8-4897-bad2-11310a778425.sh 2024-04-03T11:40:43.4196009Z ##[debug]Finished: run 2024-04-03T11:40:43.4202645Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:43.4205890Z ##[debug]Evaluating: success() 2024-04-03T11:40:43.4207448Z ##[debug]Evaluating success: 2024-04-03T11:40:43.4208964Z ##[debug]=> true 2024-04-03T11:40:43.4210424Z ##[debug]Result: true 2024-04-03T11:40:43.4212101Z ##[debug]Starting: run 2024-04-03T11:40:43.4237066Z ##[debug]Loading inputs 2024-04-03T11:40:43.4238923Z ##[debug]Loading env 2024-04-03T11:40:43.4250533Z ##[group]Run # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2024-04-03T11:40:43.4252817Z # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2024-04-03T11:40:43.4255443Z echo "GPU_FLAG=--device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon" >> "${GITHUB_ENV}" 2024-04-03T11:40:43.4310062Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:43.4311139Z env: 2024-04-03T11:40:43.4311710Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.4312572Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.4313511Z ##[endgroup] 2024-04-03T11:40:43.4358306Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/04eaaf36-47e5-4794-915c-9421c612bd2f.sh 2024-04-03T11:40:43.4399153Z ##[debug]GPU_FLAG='--device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon' 2024-04-03T11:40:43.4405385Z ##[debug]Finished: run 2024-04-03T11:40:43.4414687Z ##[debug]Finishing: Setup ROCm 2024-04-03T11:40:43.4439074Z ##[debug]Evaluating condition for step: 'configure aws credentials' 2024-04-03T11:40:43.4442318Z ##[debug]Evaluating: success() 2024-04-03T11:40:43.4443741Z ##[debug]Evaluating success: 2024-04-03T11:40:43.4445149Z ##[debug]=> true 2024-04-03T11:40:43.4446498Z ##[debug]Result: true 2024-04-03T11:40:43.4448211Z ##[debug]Starting: configure aws credentials 2024-04-03T11:40:43.4495919Z ##[debug]Register post job cleanup for action: aws-actions/configure-aws-credentials@v4 2024-04-03T11:40:43.4511312Z ##[debug]Loading inputs 2024-04-03T11:40:43.4521568Z ##[debug]Loading env 2024-04-03T11:40:43.4528100Z ##[group]Run aws-actions/configure-aws-credentials@v4 2024-04-03T11:40:43.4528632Z with: 2024-04-03T11:40:43.4529174Z role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_s3_and_ecr_read_only 2024-04-03T11:40:43.4529898Z aws-region: us-east-1 2024-04-03T11:40:43.4530306Z role-duration-seconds: 18000 2024-04-03T11:40:43.4530747Z audience: sts.amazonaws.com 2024-04-03T11:40:43.4531180Z env: 2024-04-03T11:40:43.4531482Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:43.4531944Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:43.4532830Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:43.4533649Z ##[endgroup] 2024-04-03T11:40:43.6050319Z ##[debug]ID token url is https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/00000000-0000-0000-0000-000000000000/_apis/distributedtask/hubs/Actions/plans/479d079c-d5e7-4008-87ea-9a6b11046403/jobs/6794899a-c0c6-5bd7-6598-6c5a296c045c/idtoken?api-version=2.0&audience=sts.amazonaws.com 2024-04-03T11:40:43.9779541Z ::add-mask::*** 2024-04-03T11:40:43.9798262Z ##[debug]7 role session tags are being used. 2024-04-03T11:40:43.9846863Z Assuming role with OIDC 2024-04-03T11:40:44.4057522Z Authenticated as assumedRoleId AROAUPVRELQNLLCOPFEJR:GitHubActions 2024-04-03T11:40:44.4060895Z ::add-mask::*** 2024-04-03T11:40:44.4064018Z ::add-mask::*** 2024-04-03T11:40:44.4067920Z ::add-mask::*** 2024-04-03T11:40:44.5088056Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:40:44.5091836Z ##[debug]AWS_DEFAULT_REGION='us-east-1' 2024-04-03T11:40:44.5093306Z ##[debug]AWS_REGION='us-east-1' 2024-04-03T11:40:44.5094908Z ##[debug]AWS_ACCESS_KEY_ID='***' 2024-04-03T11:40:44.5097263Z ##[debug]AWS_SECRET_ACCESS_KEY='***' 2024-04-03T11:40:44.5119680Z ##[debug]AWS_SESSION_TOKEN='***' 2024-04-03T11:40:44.5166379Z ##[debug]Set output aws-account-id = 308535385114 2024-04-03T11:40:44.5168453Z ##[debug]Finishing: configure aws credentials 2024-04-03T11:40:44.5191289Z ##[debug]Evaluating condition for step: 'Login to Amazon ECR' 2024-04-03T11:40:44.5193935Z ##[debug]Evaluating: success() 2024-04-03T11:40:44.5195073Z ##[debug]Evaluating success: 2024-04-03T11:40:44.5196220Z ##[debug]=> true 2024-04-03T11:40:44.5197298Z ##[debug]Result: true 2024-04-03T11:40:44.5198649Z ##[debug]Starting: Login to Amazon ECR 2024-04-03T11:40:44.5232258Z ##[debug]Register post job cleanup for action: aws-actions/amazon-ecr-login@v2 2024-04-03T11:40:44.5257669Z ##[debug]Loading inputs 2024-04-03T11:40:44.5265907Z ##[debug]Loading env 2024-04-03T11:40:44.5277290Z ##[group]Run aws-actions/amazon-ecr-login@v2 2024-04-03T11:40:44.5278263Z with: 2024-04-03T11:40:44.5278868Z mask-password: true 2024-04-03T11:40:44.5279655Z registry-type: private 2024-04-03T11:40:44.5280424Z skip-logout: false 2024-04-03T11:40:44.5281136Z env: 2024-04-03T11:40:44.5281720Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:44.5282628Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:44.5284380Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:44.5286020Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:44.5286870Z AWS_REGION: us-east-1 2024-04-03T11:40:44.5287799Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:44.5288904Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:44.5304156Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:44.5304936Z ##[endgroup] 2024-04-03T11:40:45.1156756Z Logging into registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.8264426Z ::add-mask::*** 2024-04-03T11:40:45.8269108Z ##[debug]Your docker password is masked. 2024-04-03T11:40:45.8271767Z ##[debug]'skip-logout' is false for 1 registries. 2024-04-03T11:40:45.8361720Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:40:45.8369358Z ##[debug]Save intra-action state registries = 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.8373260Z ##[debug]Set output registry = 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.8375986Z ##[debug]Set output docker_username_308535385114_dkr_ecr_us_east_1_amazonaws_com = AWS 2024-04-03T11:40:45.8564331Z ##[debug]Set output docker_password_308535385114_dkr_ecr_us_east_1_amazonaws_com = *** 2024-04-03T11:40:45.8565499Z ##[debug]Finishing: Login to Amazon ECR 2024-04-03T11:40:45.8576113Z ##[debug]Evaluating condition for step: 'Calculate docker image' 2024-04-03T11:40:45.8577923Z ##[debug]Evaluating: success() 2024-04-03T11:40:45.8578567Z ##[debug]Evaluating success: 2024-04-03T11:40:45.8579232Z ##[debug]=> true 2024-04-03T11:40:45.8580346Z ##[debug]Result: true 2024-04-03T11:40:45.8581158Z ##[debug]Starting: Calculate docker image 2024-04-03T11:40:45.8722221Z ##[debug]Loading inputs 2024-04-03T11:40:45.8723600Z ##[debug]Evaluating: inputs.docker-image 2024-04-03T11:40:45.8724147Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8724608Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.8725068Z ##[debug]..=> Object 2024-04-03T11:40:45.8725500Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8725965Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:45.8727111Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.8728939Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.8734756Z ##[debug]Loading env 2024-04-03T11:40:45.8741306Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2024-04-03T11:40:45.8742015Z with: 2024-04-03T11:40:45.8743053Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.8744479Z docker-build-dir: .ci/docker 2024-04-03T11:40:45.8744943Z working-directory: . 2024-04-03T11:40:45.8745514Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.8746160Z force-push: false 2024-04-03T11:40:45.8746537Z env: 2024-04-03T11:40:45.8746887Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:45.8747395Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:45.8748309Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:45.8749202Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:45.8749668Z AWS_REGION: us-east-1 2024-04-03T11:40:45.8750176Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:45.8750780Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:45.8758773Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:45.8759201Z ##[endgroup] 2024-04-03T11:40:45.8763093Z ##[debug]Evaluating: github.event.repository.name 2024-04-03T11:40:45.8763691Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8764163Z ##[debug]..Evaluating Index: 2024-04-03T11:40:45.8764630Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.8765103Z ##[debug]......Evaluating github: 2024-04-03T11:40:45.8765600Z ##[debug]......=> Object 2024-04-03T11:40:45.8766054Z ##[debug]......Evaluating String: 2024-04-03T11:40:45.8766543Z ##[debug]......=> 'event' 2024-04-03T11:40:45.8766991Z ##[debug]....=> Object 2024-04-03T11:40:45.8767426Z ##[debug]....Evaluating String: 2024-04-03T11:40:45.8767903Z ##[debug]....=> 'repository' 2024-04-03T11:40:45.8768373Z ##[debug]..=> Object 2024-04-03T11:40:45.8768809Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8769265Z ##[debug]..=> 'name' 2024-04-03T11:40:45.8769684Z ##[debug]=> 'pytorch' 2024-04-03T11:40:45.8770121Z ##[debug]Result: 'pytorch' 2024-04-03T11:40:45.8770818Z ##[debug]Evaluating: inputs.docker-image-name 2024-04-03T11:40:45.8771386Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8771846Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.8772541Z ##[debug]..=> Object 2024-04-03T11:40:45.8772979Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8773460Z ##[debug]..=> 'docker-image-name' 2024-04-03T11:40:45.8774587Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.8776381Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.8777883Z ##[debug]Evaluating: inputs.docker-build-dir 2024-04-03T11:40:45.8778644Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8779129Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.8779597Z ##[debug]..=> Object 2024-04-03T11:40:45.8780086Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8780571Z ##[debug]..=> 'docker-build-dir' 2024-04-03T11:40:45.8781154Z ##[debug]=> '.ci/docker' 2024-04-03T11:40:45.8781691Z ##[debug]Result: '.ci/docker' 2024-04-03T11:40:45.8782551Z ##[debug]Evaluating: inputs.docker-registry 2024-04-03T11:40:45.8783216Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8783752Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.8784310Z ##[debug]..=> Object 2024-04-03T11:40:45.8784810Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8785384Z ##[debug]..=> 'docker-registry' 2024-04-03T11:40:45.8786110Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:45.8787035Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:45.8787982Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:45.8789380Z ##[debug]Evaluating: success() 2024-04-03T11:40:45.8790003Z ##[debug]Evaluating success: 2024-04-03T11:40:45.8790637Z ##[debug]=> true 2024-04-03T11:40:45.8791229Z ##[debug]Result: true 2024-04-03T11:40:45.8791915Z ##[debug]Starting: run 2024-04-03T11:40:45.8801599Z ##[debug]Loading inputs 2024-04-03T11:40:45.8802834Z ##[debug]Evaluating: inputs.working-directory 2024-04-03T11:40:45.8803408Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.8803871Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.8804537Z ##[debug]..=> Object 2024-04-03T11:40:45.8804964Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.8805454Z ##[debug]..=> 'working-directory' 2024-04-03T11:40:45.8805969Z ##[debug]=> '.' 2024-04-03T11:40:45.8806360Z ##[debug]Result: '.' 2024-04-03T11:40:45.8806845Z ##[debug]Loading env 2024-04-03T11:40:45.8813139Z ##[group]Run set -ex 2024-04-03T11:40:45.8813587Z set -ex 2024-04-03T11:40:45.8813961Z  2024-04-03T11:40:45.8814663Z # If the docker build directory or the build script doesn't exist, the action will 2024-04-03T11:40:45.8815833Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2024-04-03T11:40:45.8816854Z # job could then download the pre-built image as usual 2024-04-03T11:40:45.8817785Z if [[ ! -d "${DOCKER_BUILD_DIR}" ]] || [[ ! -f "${DOCKER_BUILD_DIR}/build.sh" ]]; then 2024-04-03T11:40:45.8818625Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8819422Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8820211Z  2024-04-03T11:40:45.8820859Z  echo "There is no Docker build script in ${REPO_NAME} repo, skipping..." 2024-04-03T11:40:45.8821639Z  exit 0 2024-04-03T11:40:45.8822021Z else 2024-04-03T11:40:45.8822480Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8823046Z fi 2024-04-03T11:40:45.8823395Z  2024-04-03T11:40:45.8823992Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2024-04-03T11:40:45.8825040Z  # The docker image name already includes the ECR prefix and tag, so we can just 2024-04-03T11:40:45.8826006Z  # use it as it is, but first let's extract the tag 2024-04-03T11:40:45.8826878Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2024-04-03T11:40:45.8827783Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8828659Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8829358Z else 2024-04-03T11:40:45.8829908Z  DOCKER_TAG=$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2024-04-03T11:40:45.8830735Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8831836Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.8832911Z fi 2024-04-03T11:40:45.8867548Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:45.8868156Z env: 2024-04-03T11:40:45.8868500Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:45.8869010Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:45.8869941Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:45.8870828Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:45.8871312Z AWS_REGION: us-east-1 2024-04-03T11:40:45.8871851Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:45.8872458Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:45.8880422Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:45.8880855Z REPO_NAME: pytorch 2024-04-03T11:40:45.8881930Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.8883109Z DOCKER_BUILD_DIR: .ci/docker 2024-04-03T11:40:45.8883727Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.8884352Z ##[endgroup] 2024-04-03T11:40:45.8911761Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/1ece4e3e-7d96-46b3-b1ed-d50480bd2c31.sh 2024-04-03T11:40:45.8944263Z + [[ ! -d .ci/docker ]] 2024-04-03T11:40:45.8945688Z + [[ ! -f .ci/docker/build.sh ]] 2024-04-03T11:40:45.8946971Z + echo skip=false 2024-04-03T11:40:45.8952011Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2024-04-03T11:40:45.8959507Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.8962571Z ++ awk -F '[:,]' '{print $2}' 2024-04-03T11:40:45.8997651Z + DOCKER_TAG=16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.8999966Z + echo docker-tag=16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9003961Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9014116Z ##[debug]Set output skip = false 2024-04-03T11:40:45.9015961Z ##[debug]Set output docker-tag = 16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9019537Z ##[debug]Set output docker-image = 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9023803Z ##[debug]Finished: run 2024-04-03T11:40:45.9031505Z ##[debug]Evaluating: inputs.docker-build-dir 2024-04-03T11:40:45.9032693Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.9033640Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.9034606Z ##[debug]..=> Object 2024-04-03T11:40:45.9035481Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.9036460Z ##[debug]..=> 'docker-build-dir' 2024-04-03T11:40:45.9037503Z ##[debug]=> '.ci/docker' 2024-04-03T11:40:45.9038409Z ##[debug]Result: '.ci/docker' 2024-04-03T11:40:45.9040451Z ##[debug]Evaluating: (github.event.pull_request.base.sha || github.sha) 2024-04-03T11:40:45.9041961Z ##[debug]Evaluating Or: 2024-04-03T11:40:45.9042830Z ##[debug]..Evaluating Index: 2024-04-03T11:40:45.9043746Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.9044681Z ##[debug]......Evaluating Index: 2024-04-03T11:40:45.9045652Z ##[debug]........Evaluating Index: 2024-04-03T11:40:45.9046671Z ##[debug]..........Evaluating github: 2024-04-03T11:40:45.9047732Z ##[debug]..........=> Object 2024-04-03T11:40:45.9048675Z ##[debug]..........Evaluating String: 2024-04-03T11:40:45.9049670Z ##[debug]..........=> 'event' 2024-04-03T11:40:45.9050587Z ##[debug]........=> Object 2024-04-03T11:40:45.9051510Z ##[debug]........Evaluating String: 2024-04-03T11:40:45.9052520Z ##[debug]........=> 'pull_request' 2024-04-03T11:40:45.9053474Z ##[debug]......=> null 2024-04-03T11:40:45.9054294Z ##[debug]....=> null 2024-04-03T11:40:45.9055435Z ##[debug]..=> null 2024-04-03T11:40:45.9056264Z ##[debug]..Evaluating Index: 2024-04-03T11:40:45.9057174Z ##[debug]....Evaluating github: 2024-04-03T11:40:45.9058093Z ##[debug]....=> Object 2024-04-03T11:40:45.9058934Z ##[debug]....Evaluating String: 2024-04-03T11:40:45.9059988Z ##[debug]....=> 'sha' 2024-04-03T11:40:45.9061115Z ##[debug]..=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:40:45.9062596Z ##[debug]=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:40:45.9064313Z ##[debug]Expanded: (null || 'd253bd7befc64137de913dbb8727b511f1052411') 2024-04-03T11:40:45.9066014Z ##[debug]Result: 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:40:45.9068057Z ##[debug]Evaluating: steps.calculate-image.outputs.docker-image 2024-04-03T11:40:45.9069424Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.9070335Z ##[debug]..Evaluating Index: 2024-04-03T11:40:45.9071266Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.9072218Z ##[debug]......Evaluating steps: 2024-04-03T11:40:45.9073192Z ##[debug]......=> Object 2024-04-03T11:40:45.9074081Z ##[debug]......Evaluating String: 2024-04-03T11:40:45.9075084Z ##[debug]......=> 'calculate-image' 2024-04-03T11:40:45.9076083Z ##[debug]....=> Object 2024-04-03T11:40:45.9076936Z ##[debug]....Evaluating String: 2024-04-03T11:40:45.9077880Z ##[debug]....=> 'outputs' 2024-04-03T11:40:45.9078741Z ##[debug]..=> Object 2024-04-03T11:40:45.9079567Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.9080816Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:45.9083009Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.9086648Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.9089696Z ##[debug]Evaluating: steps.calculate-image.outputs.docker-tag 2024-04-03T11:40:45.9091018Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.9091942Z ##[debug]..Evaluating Index: 2024-04-03T11:40:45.9092865Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.9093813Z ##[debug]......Evaluating steps: 2024-04-03T11:40:45.9094763Z ##[debug]......=> Object 2024-04-03T11:40:45.9095654Z ##[debug]......Evaluating String: 2024-04-03T11:40:45.9096657Z ##[debug]......=> 'calculate-image' 2024-04-03T11:40:45.9097653Z ##[debug]....=> Object 2024-04-03T11:40:45.9098499Z ##[debug]....Evaluating String: 2024-04-03T11:40:45.9099507Z ##[debug]....=> 'outputs' 2024-04-03T11:40:45.9100563Z ##[debug]..=> Object 2024-04-03T11:40:45.9101419Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.9102385Z ##[debug]..=> 'docker-tag' 2024-04-03T11:40:45.9103584Z ##[debug]=> '16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.9105401Z ##[debug]Result: '16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:45.9107191Z ##[debug]Evaluating: inputs.docker-registry 2024-04-03T11:40:45.9108349Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.9109442Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.9110406Z ##[debug]..=> Object 2024-04-03T11:40:45.9111268Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.9112243Z ##[debug]..=> 'docker-registry' 2024-04-03T11:40:45.9113473Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:45.9115043Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:45.9116667Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:45.9124353Z ##[debug]Evaluating: (success() && (steps.calculate-image.outputs.skip != 'true') && !inputs.always-rebuild) 2024-04-03T11:40:45.9126572Z ##[debug]Evaluating And: 2024-04-03T11:40:45.9127834Z ##[debug]..Evaluating success: 2024-04-03T11:40:45.9129000Z ##[debug]..=> true 2024-04-03T11:40:45.9130105Z ##[debug]..Evaluating NotEqual: 2024-04-03T11:40:45.9133018Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.9134194Z ##[debug]......Evaluating Index: 2024-04-03T11:40:45.9135357Z ##[debug]........Evaluating Index: 2024-04-03T11:40:45.9136869Z ##[debug]..........Evaluating steps: 2024-04-03T11:40:45.9138086Z ##[debug]..........=> Object 2024-04-03T11:40:45.9139255Z ##[debug]..........Evaluating String: 2024-04-03T11:40:45.9140832Z ##[debug]..........=> 'calculate-image' 2024-04-03T11:40:45.9142059Z ##[debug]........=> Object 2024-04-03T11:40:45.9143171Z ##[debug]........Evaluating String: 2024-04-03T11:40:45.9144325Z ##[debug]........=> 'outputs' 2024-04-03T11:40:45.9145447Z ##[debug]......=> Object 2024-04-03T11:40:45.9146558Z ##[debug]......Evaluating String: 2024-04-03T11:40:45.9147666Z ##[debug]......=> 'skip' 2024-04-03T11:40:45.9148466Z ##[debug]....=> 'false' 2024-04-03T11:40:45.9149061Z ##[debug]....Evaluating String: 2024-04-03T11:40:45.9149637Z ##[debug]....=> 'true' 2024-04-03T11:40:45.9150633Z ##[debug]..=> true 2024-04-03T11:40:45.9151405Z ##[debug]..Evaluating Not: 2024-04-03T11:40:45.9152657Z ##[debug]....Evaluating Index: 2024-04-03T11:40:45.9153268Z ##[debug]......Evaluating inputs: 2024-04-03T11:40:45.9153867Z ##[debug]......=> Object 2024-04-03T11:40:45.9154457Z ##[debug]......Evaluating String: 2024-04-03T11:40:45.9155085Z ##[debug]......=> 'always-rebuild' 2024-04-03T11:40:45.9155680Z ##[debug]....=> '' 2024-04-03T11:40:45.9156183Z ##[debug]..=> true 2024-04-03T11:40:45.9156901Z ##[debug]=> true 2024-04-03T11:40:45.9159490Z ##[debug]Expanded: (true && ('false' != 'true') && !'') 2024-04-03T11:40:45.9160258Z ##[debug]Result: true 2024-04-03T11:40:45.9160947Z ##[debug]Starting: run 2024-04-03T11:40:45.9172441Z ##[debug]Loading inputs 2024-04-03T11:40:45.9173687Z ##[debug]Evaluating: inputs.working-directory 2024-04-03T11:40:45.9174253Z ##[debug]Evaluating Index: 2024-04-03T11:40:45.9174702Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:45.9175162Z ##[debug]..=> Object 2024-04-03T11:40:45.9175580Z ##[debug]..Evaluating String: 2024-04-03T11:40:45.9176055Z ##[debug]..=> 'working-directory' 2024-04-03T11:40:45.9176556Z ##[debug]=> '.' 2024-04-03T11:40:45.9176934Z ##[debug]Result: '.' 2024-04-03T11:40:45.9177401Z ##[debug]Loading env 2024-04-03T11:40:45.9183701Z ##[group]Run set +e 2024-04-03T11:40:45.9184124Z set +e 2024-04-03T11:40:45.9184484Z set -x 2024-04-03T11:40:45.9184835Z  2024-04-03T11:40:45.9185168Z login() { 2024-04-03T11:40:45.9185957Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-04-03T11:40:45.9186831Z } 2024-04-03T11:40:45.9187164Z  2024-04-03T11:40:45.9187498Z retry () { 2024-04-03T11:40:45.9187971Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-04-03T11:40:45.9188508Z } 2024-04-03T11:40:45.9188837Z  2024-04-03T11:40:45.9189219Z retry login "${DOCKER_REGISTRY}" 2024-04-03T11:40:45.9189719Z  2024-04-03T11:40:45.9190291Z # Check if image already exists, if it does then skip building it 2024-04-03T11:40:45.9191150Z if docker manifest inspect "${DOCKER_IMAGE}"; then 2024-04-03T11:40:45.9191766Z  exit 0 2024-04-03T11:40:45.9192128Z fi 2024-04-03T11:40:45.9192461Z  2024-04-03T11:40:45.9193070Z # NB: This part requires a full checkout. Otherwise, the merge base will 2024-04-03T11:40:45.9194082Z # be empty. The default action would be to continue rebuild the image 2024-04-03T11:40:45.9194980Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2024-04-03T11:40:45.9195794Z  # if we're on the base branch then use the parent commit 2024-04-03T11:40:45.9196520Z  MERGE_BASE=$(git rev-parse HEAD~) 2024-04-03T11:40:45.9197038Z else 2024-04-03T11:40:45.9197603Z  # otherwise we're on a PR, so use the most recent base commit 2024-04-03T11:40:45.9198444Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2024-04-03T11:40:45.9199053Z fi 2024-04-03T11:40:45.9199387Z  2024-04-03T11:40:45.9199773Z if [[ -z "${MERGE_BASE}" ]]; then 2024-04-03T11:40:45.9200497Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.9201054Z  2024-04-03T11:40:45.9201842Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2024-04-03T11:40:45.9202764Z  exit 0 2024-04-03T11:40:45.9203123Z fi 2024-04-03T11:40:45.9203453Z  2024-04-03T11:40:45.9203980Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2024-04-03T11:40:45.9205170Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2024-04-03T11:40:45.9206148Z  exit 1 2024-04-03T11:40:45.9206507Z fi 2024-04-03T11:40:45.9206837Z  2024-04-03T11:40:45.9207426Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2024-04-03T11:40:45.9208585Z # If no image exists but the hash is the same as the previous hash then we should error out here 2024-04-03T11:40:45.9209651Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2024-04-03T11:40:45.9210819Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2024-04-03T11:40:45.9212164Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2024-04-03T11:40:45.9212957Z fi 2024-04-03T11:40:45.9213289Z  2024-04-03T11:40:45.9213712Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:45.9246753Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:45.9247345Z env: 2024-04-03T11:40:45.9247673Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:45.9248164Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:45.9249073Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:45.9249943Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:45.9250404Z AWS_REGION: us-east-1 2024-04-03T11:40:45.9250932Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:45.9251527Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:45.9259504Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:45.9260112Z DOCKER_BUILD_DIR: .ci/docker 2024-04-03T11:40:45.9260672Z BASE_REVISION: d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:40:45.9261920Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9263145Z DOCKER_TAG: 16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:45.9263851Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.9264465Z ##[endgroup] 2024-04-03T11:40:45.9291485Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/0bb6e8db-23a2-4e5c-ab99-b59b225397cb.sh 2024-04-03T11:40:45.9325252Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.9327339Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:45.9331401Z + aws ecr get-login-password --region us-east-1 2024-04-03T11:40:45.9334107Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:47.3257536Z WARNING! Your password will be stored unencrypted in /home/pytorchci/.docker/config.json. 2024-04-03T11:40:47.3260441Z Configure a credential helper to remove this warning. See 2024-04-03T11:40:47.3263443Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-04-03T11:40:47.3265226Z 2024-04-03T11:40:47.3265643Z Login Succeeded 2024-04-03T11:40:47.3310405Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:48.8901874Z { 2024-04-03T11:40:48.8903070Z "schemaVersion": 2, 2024-04-03T11:40:48.8904900Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2024-04-03T11:40:48.8906921Z "config": { 2024-04-03T11:40:48.8909215Z "mediaType": "application/vnd.docker.container.image.v1+json", 2024-04-03T11:40:48.8911135Z "size": 20924, 2024-04-03T11:40:48.8913015Z "digest": "sha256:2da876c634db218fb493be47975c50245dfa4aace559dcf6d45fc2f2bf0ebf9f" 2024-04-03T11:40:48.8915156Z }, 2024-04-03T11:40:48.8916032Z "layers": [ 2024-04-03T11:40:48.8916964Z { 2024-04-03T11:40:48.8918431Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8920757Z "size": 28584317, 2024-04-03T11:40:48.8922638Z "digest": "sha256:63e9bbe323274e77e58d77c6ab6802d247458f784222fbb07a2556d6ec74ee05" 2024-04-03T11:40:48.8924713Z }, 2024-04-03T11:40:48.8925564Z { 2024-04-03T11:40:48.8927028Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8928924Z "size": 1827, 2024-04-03T11:40:48.8930776Z "digest": "sha256:020f47d0fa1be3a2df8dd2e863841d351ab72d00c7df4a4a16879ceea1239f4b" 2024-04-03T11:40:48.8932907Z }, 2024-04-03T11:40:48.8933756Z { 2024-04-03T11:40:48.8935236Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8937151Z "size": 312536689, 2024-04-03T11:40:48.8939031Z "digest": "sha256:75256bc499c110e379a88a45dfa3d745edc453bf22af7e437a3439a48477cb30" 2024-04-03T11:40:48.8941593Z }, 2024-04-03T11:40:48.8942451Z { 2024-04-03T11:40:48.8943921Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8945811Z "size": 802, 2024-04-03T11:40:48.8947547Z "digest": "sha256:5e6597056347c056681d9213c22a37f96446cb7b0d9b6838b5e6101ac98c7e8a" 2024-04-03T11:40:48.8949992Z }, 2024-04-03T11:40:48.8950842Z { 2024-04-03T11:40:48.8952283Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8954391Z "size": 106, 2024-04-03T11:40:48.8956662Z "digest": "sha256:39f9553cbd3080af50567b0579d8446f7df6aaed7ea18cc5fd1fe81a6a32ec59" 2024-04-03T11:40:48.8958761Z }, 2024-04-03T11:40:48.8959595Z { 2024-04-03T11:40:48.8961047Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8962937Z "size": 544, 2024-04-03T11:40:48.8964697Z "digest": "sha256:0e1508d7c9ab1501f0c741f4729a53eb3758856d8761c5258f732506d73c594e" 2024-04-03T11:40:48.8966746Z }, 2024-04-03T11:40:48.8967586Z { 2024-04-03T11:40:48.8969032Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8970933Z "size": 1280, 2024-04-03T11:40:48.8972740Z "digest": "sha256:41da3b9e5a92a4819d7d1ec780621eec339941e9a6a7abf909cbca2b5349acd9" 2024-04-03T11:40:48.8974855Z }, 2024-04-03T11:40:48.8975690Z { 2024-04-03T11:40:48.8977150Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8979023Z "size": 3257, 2024-04-03T11:40:48.8980962Z "digest": "sha256:39a0280ee9ffc8f17651a2faad05bb2e458d28cb725dd342726890d10ef20610" 2024-04-03T11:40:48.8983093Z }, 2024-04-03T11:40:48.8983937Z { 2024-04-03T11:40:48.8985420Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8987316Z "size": 1904, 2024-04-03T11:40:48.8989126Z "digest": "sha256:a40a390f581ad70eca936ae2fe3aeadb7e19642cd6211232d77be76ba9fa7dd2" 2024-04-03T11:40:48.8991236Z }, 2024-04-03T11:40:48.8992075Z { 2024-04-03T11:40:48.8993522Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.8995397Z "size": 699, 2024-04-03T11:40:48.8997256Z "digest": "sha256:01d4a84baddbd86b6da32efdf2e97a1088a27be6c25e024af3e1bc20f1db320e" 2024-04-03T11:40:48.8999433Z }, 2024-04-03T11:40:48.9000272Z { 2024-04-03T11:40:48.9001722Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9003611Z "size": 2444293878, 2024-04-03T11:40:48.9005516Z "digest": "sha256:ba56e9f617fd9b3f17c36073f41b192ab5ecacb377c7c3db86ad2b73624532a3" 2024-04-03T11:40:48.9007633Z }, 2024-04-03T11:40:48.9008473Z { 2024-04-03T11:40:48.9009928Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9011842Z "size": 380, 2024-04-03T11:40:48.9013923Z "digest": "sha256:3eaa6199e98399217ecd8fe5727b666c8273744f0518bc3e5b1ac140ce3d123c" 2024-04-03T11:40:48.9016017Z }, 2024-04-03T11:40:48.9016847Z { 2024-04-03T11:40:48.9018297Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9020566Z "size": 11113, 2024-04-03T11:40:48.9022387Z "digest": "sha256:72b9cb4426d36f506b4e09553107e363ded2de2c2b8e7d77b852d7f2e26ac72e" 2024-04-03T11:40:48.9024468Z }, 2024-04-03T11:40:48.9025318Z { 2024-04-03T11:40:48.9026772Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9028647Z "size": 503, 2024-04-03T11:40:48.9030408Z "digest": "sha256:9cb381d49846ff5212226715691aa2c6abc9987a95b92335e81a6d03d9c4f944" 2024-04-03T11:40:48.9032449Z }, 2024-04-03T11:40:48.9032918Z { 2024-04-03T11:40:48.9033510Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9034183Z "size": 121477661, 2024-04-03T11:40:48.9034859Z "digest": "sha256:95478f636bc2f9e67687eba04f0e1194ed3bfe16fb5a2d8302342a7be969e74e" 2024-04-03T11:40:48.9035587Z }, 2024-04-03T11:40:48.9035896Z { 2024-04-03T11:40:48.9036421Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9037091Z "size": 109, 2024-04-03T11:40:48.9037738Z "digest": "sha256:5713988df83f1fa6e795a6df0ae991fad2bfaf3c1fcc7d19d4f6266bbeb812cf" 2024-04-03T11:40:48.9038490Z }, 2024-04-03T11:40:48.9038797Z { 2024-04-03T11:40:48.9039453Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9040124Z "size": 557, 2024-04-03T11:40:48.9040765Z "digest": "sha256:b4f17b88d23ef498cfc2839ad0358588aaca3004f277c84f3ec7d68336482c1e" 2024-04-03T11:40:48.9041501Z }, 2024-04-03T11:40:48.9041806Z { 2024-04-03T11:40:48.9042323Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9042993Z "size": 1146788, 2024-04-03T11:40:48.9043661Z "digest": "sha256:dec6c3f0b4d635f0e1e48d69f7687aa6dba43fb9ba35d52203a77fe96b7e955a" 2024-04-03T11:40:48.9044424Z }, 2024-04-03T11:40:48.9044732Z { 2024-04-03T11:40:48.9045271Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9045940Z "size": 103, 2024-04-03T11:40:48.9046587Z "digest": "sha256:4fa024201f9dace570b791623cc528debcc6de0031ac5d46f6f25ccfa60c1329" 2024-04-03T11:40:48.9047325Z }, 2024-04-03T11:40:48.9047631Z { 2024-04-03T11:40:48.9048154Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9048828Z "size": 1489, 2024-04-03T11:40:48.9049486Z "digest": "sha256:0fbc7e84a570fa00c6b12df3ffc46a2b4ff2786d9e5564886359fe90b556556d" 2024-04-03T11:40:48.9050234Z }, 2024-04-03T11:40:48.9050540Z { 2024-04-03T11:40:48.9051058Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9051733Z "size": 442811274, 2024-04-03T11:40:48.9052407Z "digest": "sha256:cf4bd8ef6194084cfc8cc2d621e3d8489a25e08cedd600cced8b2908b81a27ab" 2024-04-03T11:40:48.9053158Z }, 2024-04-03T11:40:48.9053472Z { 2024-04-03T11:40:48.9053992Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9054663Z "size": 160, 2024-04-03T11:40:48.9055305Z "digest": "sha256:88083474f81ba7c96d89f785be1ced4ad23d3389eacd70de46232a5cff6484a6" 2024-04-03T11:40:48.9056043Z }, 2024-04-03T11:40:48.9056347Z { 2024-04-03T11:40:48.9056870Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9057546Z "size": 1880, 2024-04-03T11:40:48.9058195Z "digest": "sha256:226d5ffc9b8ead0054250cfe6b21382e6d193618f2a7801bc0b8fd8ad757ad50" 2024-04-03T11:40:48.9058933Z }, 2024-04-03T11:40:48.9059240Z { 2024-04-03T11:40:48.9059803Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9060484Z "size": 6027431235, 2024-04-03T11:40:48.9061154Z "digest": "sha256:36ee3c5f091c5523f9a8d3f42c5754be5a3b3636f45d7c7db5047067df27cc43" 2024-04-03T11:40:48.9061893Z }, 2024-04-03T11:40:48.9062200Z { 2024-04-03T11:40:48.9062832Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9063512Z "size": 105, 2024-04-03T11:40:48.9064142Z "digest": "sha256:8c991bcb055736f6ccd1372e9fb6146d8335370660929792e401886813745edd" 2024-04-03T11:40:48.9064861Z }, 2024-04-03T11:40:48.9065173Z { 2024-04-03T11:40:48.9065710Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9066380Z "size": 840, 2024-04-03T11:40:48.9067025Z "digest": "sha256:f5a36972c0f3bf9be222ed5ce3f8480a4e86a861f8698c55e83ef85c098c6990" 2024-04-03T11:40:48.9067762Z }, 2024-04-03T11:40:48.9068073Z { 2024-04-03T11:40:48.9068592Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9069262Z "size": 324088466, 2024-04-03T11:40:48.9069926Z "digest": "sha256:af429abc069001e3b1f6cc2e120ed63e9996da47862b03803d04d860118298ea" 2024-04-03T11:40:48.9070668Z }, 2024-04-03T11:40:48.9070975Z { 2024-04-03T11:40:48.9071501Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9072168Z "size": 107, 2024-04-03T11:40:48.9072803Z "digest": "sha256:020a310c3fde4ccd6369b8538b4fa62a06948f0b38421343b4a2028d690fe94b" 2024-04-03T11:40:48.9073536Z }, 2024-04-03T11:40:48.9073841Z { 2024-04-03T11:40:48.9074358Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9075030Z "size": 564, 2024-04-03T11:40:48.9075773Z "digest": "sha256:c584774f42519400f1d08734f82c70984be83c66222c2bcf7b6d6703895f5c79" 2024-04-03T11:40:48.9076493Z }, 2024-04-03T11:40:48.9076803Z { 2024-04-03T11:40:48.9077324Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9077998Z "size": 43192942, 2024-04-03T11:40:48.9078652Z "digest": "sha256:686e6512fd575c3f07f5bc23cda488f6bb7272f196e3de8b0307f683798312af" 2024-04-03T11:40:48.9079387Z }, 2024-04-03T11:40:48.9079696Z { 2024-04-03T11:40:48.9080223Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9080889Z "size": 106, 2024-04-03T11:40:48.9081531Z "digest": "sha256:dfd193f5fa4da2028a19b80e683da1533c60e82c9103c267177d5a3fdcad49c1" 2024-04-03T11:40:48.9082269Z }, 2024-04-03T11:40:48.9082580Z { 2024-04-03T11:40:48.9083095Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9083762Z "size": 293, 2024-04-03T11:40:48.9084403Z "digest": "sha256:3d8c7f18c87fd97d0c28231750bcd2438b97d2ec2bd2d1597bf43ba8b7179bea" 2024-04-03T11:40:48.9085153Z }, 2024-04-03T11:40:48.9085469Z { 2024-04-03T11:40:48.9086000Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9086664Z "size": 88293, 2024-04-03T11:40:48.9087302Z "digest": "sha256:4b38534173e8835e0566d5733265b84fae31743a127cfdec64fced43152ef405" 2024-04-03T11:40:48.9088027Z }, 2024-04-03T11:40:48.9088329Z { 2024-04-03T11:40:48.9088856Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9089536Z "size": 106, 2024-04-03T11:40:48.9090202Z "digest": "sha256:0befffeaba97bacdbd851934fc70bb45767ff56ab704c15f83e15d8866e7d4ca" 2024-04-03T11:40:48.9090954Z }, 2024-04-03T11:40:48.9091262Z { 2024-04-03T11:40:48.9091788Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9092460Z "size": 1192, 2024-04-03T11:40:48.9093089Z "digest": "sha256:f2934ba32628883717478385639d4c5f0403a09363b8cf2bb73b328e3b8cc042" 2024-04-03T11:40:48.9093820Z }, 2024-04-03T11:40:48.9094130Z { 2024-04-03T11:40:48.9094652Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9095323Z "size": 699, 2024-04-03T11:40:48.9095981Z "digest": "sha256:01d4a84baddbd86b6da32efdf2e97a1088a27be6c25e024af3e1bc20f1db320e" 2024-04-03T11:40:48.9096736Z }, 2024-04-03T11:40:48.9097043Z { 2024-04-03T11:40:48.9097565Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9098236Z "size": 139, 2024-04-03T11:40:48.9098972Z "digest": "sha256:fc53e056e149a266192b06ba1b66e73c6190f0defcec4695a4e5774198b9d401" 2024-04-03T11:40:48.9099766Z }, 2024-04-03T11:40:48.9100075Z { 2024-04-03T11:40:48.9100597Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9101271Z "size": 119, 2024-04-03T11:40:48.9101912Z "digest": "sha256:9c3c29d675a6f3566456d0b187f18d1a5deac38949fea55e068dbb7bc69d8bb4" 2024-04-03T11:40:48.9102652Z }, 2024-04-03T11:40:48.9102972Z { 2024-04-03T11:40:48.9103494Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9104170Z "size": 1155940846, 2024-04-03T11:40:48.9104829Z "digest": "sha256:c3a7f0be00c0959a5e543095ecea091c1381495088568c606589d26265c32a36" 2024-04-03T11:40:48.9105561Z }, 2024-04-03T11:40:48.9105875Z { 2024-04-03T11:40:48.9106406Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9107078Z "size": 178, 2024-04-03T11:40:48.9107741Z "digest": "sha256:0f8ff1385f0da423e5ebc72da58f634beed7c2782aca2098173bbb5c7ac51ea0" 2024-04-03T11:40:48.9108489Z }, 2024-04-03T11:40:48.9108793Z { 2024-04-03T11:40:48.9109323Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9109991Z "size": 1840, 2024-04-03T11:40:48.9110640Z "digest": "sha256:d39766ef1e6b8b4b6e2ae3d7d9f23f104b3c0309eefbd8bb4e5d3e362937299a" 2024-04-03T11:40:48.9111381Z }, 2024-04-03T11:40:48.9111689Z { 2024-04-03T11:40:48.9112350Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-04-03T11:40:48.9113017Z "size": 15983023, 2024-04-03T11:40:48.9113692Z "digest": "sha256:7bbe6355f0c3d70fec00bbf5103b2595bedee4fd79a36b8ee3d23be9f5ffc859" 2024-04-03T11:40:48.9114436Z } 2024-04-03T11:40:48.9114740Z ] 2024-04-03T11:40:48.9115043Z } 2024-04-03T11:40:48.9115378Z + exit 0 2024-04-03T11:40:48.9120886Z ##[debug]Finished: run 2024-04-03T11:40:48.9125498Z ##[debug]Evaluating: github.event.repository.name 2024-04-03T11:40:48.9126113Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9126630Z ##[debug]..Evaluating Index: 2024-04-03T11:40:48.9127101Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9127578Z ##[debug]......Evaluating github: 2024-04-03T11:40:48.9128124Z ##[debug]......=> Object 2024-04-03T11:40:48.9128624Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9129108Z ##[debug]......=> 'event' 2024-04-03T11:40:48.9129797Z ##[debug]....=> Object 2024-04-03T11:40:48.9130226Z ##[debug]....Evaluating String: 2024-04-03T11:40:48.9130721Z ##[debug]....=> 'repository' 2024-04-03T11:40:48.9131180Z ##[debug]..=> Object 2024-04-03T11:40:48.9131600Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9132053Z ##[debug]..=> 'name' 2024-04-03T11:40:48.9132523Z ##[debug]=> 'pytorch' 2024-04-03T11:40:48.9132978Z ##[debug]Result: 'pytorch' 2024-04-03T11:40:48.9133610Z ##[debug]Evaluating: inputs.push 2024-04-03T11:40:48.9134103Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9134566Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:48.9135024Z ##[debug]..=> Object 2024-04-03T11:40:48.9135440Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9135896Z ##[debug]..=> 'push' 2024-04-03T11:40:48.9136301Z ##[debug]=> '' 2024-04-03T11:40:48.9136682Z ##[debug]Result: '' 2024-04-03T11:40:48.9137243Z ##[debug]Evaluating: inputs.force-push 2024-04-03T11:40:48.9137767Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9138219Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:48.9138673Z ##[debug]..=> Object 2024-04-03T11:40:48.9139097Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9139561Z ##[debug]..=> 'force-push' 2024-04-03T11:40:48.9140166Z ##[debug]=> 'false' 2024-04-03T11:40:48.9140576Z ##[debug]Result: 'false' 2024-04-03T11:40:48.9141340Z ##[debug]Evaluating: steps.calculate-image.outputs.docker-image 2024-04-03T11:40:48.9142024Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9142476Z ##[debug]..Evaluating Index: 2024-04-03T11:40:48.9142947Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9143434Z ##[debug]......Evaluating steps: 2024-04-03T11:40:48.9143915Z ##[debug]......=> Object 2024-04-03T11:40:48.9144556Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9145077Z ##[debug]......=> 'calculate-image' 2024-04-03T11:40:48.9145589Z ##[debug]....=> Object 2024-04-03T11:40:48.9146020Z ##[debug]....Evaluating String: 2024-04-03T11:40:48.9146492Z ##[debug]....=> 'outputs' 2024-04-03T11:40:48.9146923Z ##[debug]..=> Object 2024-04-03T11:40:48.9147340Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9147813Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:48.9148917Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9150718Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9152089Z ##[debug]Evaluating: inputs.docker-registry 2024-04-03T11:40:48.9152646Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9153103Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:48.9153576Z ##[debug]..=> Object 2024-04-03T11:40:48.9153994Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9154471Z ##[debug]..=> 'docker-registry' 2024-04-03T11:40:48.9155070Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:48.9155843Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:48.9156721Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:48.9160061Z ##[debug]Evaluating: (success() && (steps.calculate-image.outputs.skip != 'true') && (inputs.always-rebuild || steps.check-image.outputs.rebuild)) 2024-04-03T11:40:48.9161496Z ##[debug]Evaluating And: 2024-04-03T11:40:48.9162085Z ##[debug]..Evaluating success: 2024-04-03T11:40:48.9162931Z ##[debug]..=> true 2024-04-03T11:40:48.9163554Z ##[debug]..Evaluating NotEqual: 2024-04-03T11:40:48.9164199Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9164834Z ##[debug]......Evaluating Index: 2024-04-03T11:40:48.9165466Z ##[debug]........Evaluating Index: 2024-04-03T11:40:48.9166136Z ##[debug]..........Evaluating steps: 2024-04-03T11:40:48.9166784Z ##[debug]..........=> Object 2024-04-03T11:40:48.9167416Z ##[debug]..........Evaluating String: 2024-04-03T11:40:48.9168090Z ##[debug]..........=> 'calculate-image' 2024-04-03T11:40:48.9168747Z ##[debug]........=> Object 2024-04-03T11:40:48.9169361Z ##[debug]........Evaluating String: 2024-04-03T11:40:48.9169988Z ##[debug]........=> 'outputs' 2024-04-03T11:40:48.9170579Z ##[debug]......=> Object 2024-04-03T11:40:48.9171181Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9171806Z ##[debug]......=> 'skip' 2024-04-03T11:40:48.9172365Z ##[debug]....=> 'false' 2024-04-03T11:40:48.9172952Z ##[debug]....Evaluating String: 2024-04-03T11:40:48.9173561Z ##[debug]....=> 'true' 2024-04-03T11:40:48.9174147Z ##[debug]..=> true 2024-04-03T11:40:48.9174717Z ##[debug]..Evaluating Or: 2024-04-03T11:40:48.9175306Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9175940Z ##[debug]......Evaluating inputs: 2024-04-03T11:40:48.9176551Z ##[debug]......=> Object 2024-04-03T11:40:48.9177160Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9177804Z ##[debug]......=> 'always-rebuild' 2024-04-03T11:40:48.9178431Z ##[debug]....=> '' 2024-04-03T11:40:48.9178982Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9179716Z ##[debug]......Evaluating Index: 2024-04-03T11:40:48.9180365Z ##[debug]........Evaluating Index: 2024-04-03T11:40:48.9181015Z ##[debug]..........Evaluating steps: 2024-04-03T11:40:48.9181665Z ##[debug]..........=> Object 2024-04-03T11:40:48.9182289Z ##[debug]..........Evaluating String: 2024-04-03T11:40:48.9182956Z ##[debug]..........=> 'check-image' 2024-04-03T11:40:48.9183583Z ##[debug]........=> Object 2024-04-03T11:40:48.9184200Z ##[debug]........Evaluating String: 2024-04-03T11:40:48.9184845Z ##[debug]........=> 'outputs' 2024-04-03T11:40:48.9185433Z ##[debug]......=> Object 2024-04-03T11:40:48.9186032Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9186653Z ##[debug]......=> 'rebuild' 2024-04-03T11:40:48.9187230Z ##[debug]....=> null 2024-04-03T11:40:48.9187889Z ##[debug]..=> null 2024-04-03T11:40:48.9188453Z ##[debug]=> null 2024-04-03T11:40:48.9189562Z ##[debug]Expanded: (true && ('false' != 'true') && ('' || null)) 2024-04-03T11:40:48.9190387Z ##[debug]Result: null 2024-04-03T11:40:48.9245827Z ##[debug]Evaluating: steps.calculate-image.outputs.docker-image 2024-04-03T11:40:48.9246546Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9247008Z ##[debug]..Evaluating Index: 2024-04-03T11:40:48.9247491Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9247969Z ##[debug]......Evaluating steps: 2024-04-03T11:40:48.9248458Z ##[debug]......=> Object 2024-04-03T11:40:48.9248914Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9249427Z ##[debug]......=> 'calculate-image' 2024-04-03T11:40:48.9249934Z ##[debug]....=> Object 2024-04-03T11:40:48.9250372Z ##[debug]....Evaluating String: 2024-04-03T11:40:48.9250847Z ##[debug]....=> 'outputs' 2024-04-03T11:40:48.9251297Z ##[debug]..=> Object 2024-04-03T11:40:48.9251716Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9252197Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:48.9253304Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9255109Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9279019Z ##[debug]Finishing: Calculate docker image 2024-04-03T11:40:48.9290849Z ##[debug]Evaluating condition for step: 'Pull docker image' 2024-04-03T11:40:48.9292307Z ##[debug]Evaluating: success() 2024-04-03T11:40:48.9292939Z ##[debug]Evaluating success: 2024-04-03T11:40:48.9293643Z ##[debug]=> true 2024-04-03T11:40:48.9294241Z ##[debug]Result: true 2024-04-03T11:40:48.9295070Z ##[debug]Starting: Pull docker image 2024-04-03T11:40:48.9336375Z ##[debug]Loading inputs 2024-04-03T11:40:48.9337809Z ##[debug]Evaluating: steps.calculate-docker-image.outputs.docker-image 2024-04-03T11:40:48.9338559Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9339019Z ##[debug]..Evaluating Index: 2024-04-03T11:40:48.9339495Z ##[debug]....Evaluating Index: 2024-04-03T11:40:48.9340046Z ##[debug]......Evaluating steps: 2024-04-03T11:40:48.9340550Z ##[debug]......=> Object 2024-04-03T11:40:48.9341007Z ##[debug]......Evaluating String: 2024-04-03T11:40:48.9341537Z ##[debug]......=> 'calculate-docker-image' 2024-04-03T11:40:48.9342090Z ##[debug]....=> Object 2024-04-03T11:40:48.9342617Z ##[debug]....Evaluating String: 2024-04-03T11:40:48.9343095Z ##[debug]....=> 'outputs' 2024-04-03T11:40:48.9343535Z ##[debug]..=> Object 2024-04-03T11:40:48.9343958Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9344428Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:48.9345523Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9347330Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9349696Z ##[debug]Loading env 2024-04-03T11:40:48.9356427Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2024-04-03T11:40:48.9357100Z with: 2024-04-03T11:40:48.9358100Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:48.9359421Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:48.9360052Z env: 2024-04-03T11:40:48.9360398Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:48.9360909Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:48.9361826Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:48.9362704Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:48.9363176Z AWS_REGION: us-east-1 2024-04-03T11:40:48.9363842Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:48.9364460Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:48.9372441Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:48.9372880Z ##[endgroup] 2024-04-03T11:40:48.9375636Z ##[debug]Evaluating: inputs.docker-image 2024-04-03T11:40:48.9376181Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9376640Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:48.9377118Z ##[debug]..=> Object 2024-04-03T11:40:48.9377539Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9378014Z ##[debug]..=> 'docker-image' 2024-04-03T11:40:48.9379109Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9380974Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:40:48.9382319Z ##[debug]Evaluating: inputs.docker-registry 2024-04-03T11:40:48.9382875Z ##[debug]Evaluating Index: 2024-04-03T11:40:48.9383329Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:48.9383799Z ##[debug]..=> Object 2024-04-03T11:40:48.9384227Z ##[debug]..Evaluating String: 2024-04-03T11:40:48.9384715Z ##[debug]..=> 'docker-registry' 2024-04-03T11:40:48.9385319Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:48.9386077Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com' 2024-04-03T11:40:48.9386839Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:48.9387961Z ##[debug]Evaluating: success() 2024-04-03T11:40:48.9388901Z ##[debug]Evaluating success: 2024-04-03T11:40:48.9389523Z ##[debug]=> true 2024-04-03T11:40:48.9390128Z ##[debug]Result: true 2024-04-03T11:40:48.9390856Z ##[debug]Starting: run 2024-04-03T11:40:48.9400459Z ##[debug]Loading inputs 2024-04-03T11:40:48.9401479Z ##[debug]Loading env 2024-04-03T11:40:48.9407720Z ##[group]Run set -x 2024-04-03T11:40:48.9408136Z set -x 2024-04-03T11:40:48.9408499Z set +e 2024-04-03T11:40:48.9408861Z  2024-04-03T11:40:48.9409209Z login() { 2024-04-03T11:40:48.9410002Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-04-03T11:40:48.9410891Z } 2024-04-03T11:40:48.9411234Z  2024-04-03T11:40:48.9411572Z retry () { 2024-04-03T11:40:48.9412042Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-04-03T11:40:48.9412586Z } 2024-04-03T11:40:48.9412932Z  2024-04-03T11:40:48.9413325Z retry login "${DOCKER_REGISTRY}" 2024-04-03T11:40:48.9413854Z  2024-04-03T11:40:48.9414196Z set -e 2024-04-03T11:40:48.9414771Z # ignore output since only exit code is used for conditional 2024-04-03T11:40:48.9415632Z # only pull docker image if it's not available locally 2024-04-03T11:40:48.9416562Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2024-04-03T11:40:48.9417415Z  retry docker pull "${DOCKER_IMAGE}" 2024-04-03T11:40:48.9417955Z fi 2024-04-03T11:40:48.9450774Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:48.9451382Z env: 2024-04-03T11:40:48.9451721Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:48.9452223Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:48.9453152Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:48.9454033Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:48.9454504Z AWS_REGION: us-east-1 2024-04-03T11:40:48.9455026Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:48.9455633Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:48.9463705Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:48.9464796Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:48.9466077Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:48.9466701Z ##[endgroup] 2024-04-03T11:40:48.9494038Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/8e621b59-35fa-471c-8cc1-73813cc36af8.sh 2024-04-03T11:40:48.9528089Z + set +e 2024-04-03T11:40:48.9529938Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:48.9531976Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:48.9534478Z + aws ecr get-login-password --region us-east-1 2024-04-03T11:40:48.9538241Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T11:40:50.3447804Z WARNING! Your password will be stored unencrypted in /home/pytorchci/.docker/config.json. 2024-04-03T11:40:50.3450443Z Configure a credential helper to remove this warning. See 2024-04-03T11:40:50.3453284Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-04-03T11:40:50.3454964Z 2024-04-03T11:40:50.3455342Z Login Succeeded 2024-04-03T11:40:50.3497133Z + set -e 2024-04-03T11:40:50.3500613Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:40:50.3873924Z ##[debug]Finished: run 2024-04-03T11:40:50.3884781Z ##[debug]Finishing: Pull docker image 2024-04-03T11:40:50.3912267Z ##[debug]Evaluating condition for step: 'Start monitoring script' 2024-04-03T11:40:50.3916135Z ##[debug]Evaluating: success() 2024-04-03T11:40:50.3917775Z ##[debug]Evaluating success: 2024-04-03T11:40:50.3919436Z ##[debug]=> true 2024-04-03T11:40:50.3921442Z ##[debug]Result: true 2024-04-03T11:40:50.3923401Z ##[debug]Starting: Start monitoring script 2024-04-03T11:40:50.3950186Z ##[debug]Loading inputs 2024-04-03T11:40:50.3952984Z ##[debug]Loading env 2024-04-03T11:40:50.3979583Z ##[group]Run python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-04-03T11:40:50.3982166Z python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-04-03T11:40:50.3984425Z python3 -m tools.stats.monitor > usage_log.txt 2>&1 & 2024-04-03T11:40:50.3986559Z echo "monitor-script-pid=${!}" >> "${GITHUB_OUTPUT}" 2024-04-03T11:40:50.4044210Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:40:50.4044818Z env: 2024-04-03T11:40:50.4045215Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:50.4045727Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:50.4046646Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:50.4047532Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:50.4048029Z AWS_REGION: us-east-1 2024-04-03T11:40:50.4048575Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:50.4049310Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:50.4057464Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:50.4057887Z ##[endgroup] 2024-04-03T11:40:50.4087158Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/8a8aa7d1-86a7-4110-bc6d-aefb62e8dc93.sh 2024-04-03T11:40:50.7673300Z Defaulting to user installation because normal site-packages is not writeable 2024-04-03T11:40:50.8791376Z Requirement already satisfied: psutil==5.9.1 in /home/pytorchci/.local/lib/python3.10/site-packages (5.9.1) 2024-04-03T11:40:50.8799350Z Requirement already satisfied: nvidia-ml-py==11.525.84 in /home/pytorchci/.local/lib/python3.10/site-packages (11.525.84) 2024-04-03T11:40:51.1305683Z ##[debug]Set output monitor-script-pid = 3036295 2024-04-03T11:40:51.1308214Z ##[debug]Finishing: Start monitoring script 2024-04-03T11:40:51.1331443Z ##[debug]Evaluating condition for step: 'Download build artifacts' 2024-04-03T11:40:51.1334956Z ##[debug]Evaluating: success() 2024-04-03T11:40:51.1336470Z ##[debug]Evaluating success: 2024-04-03T11:40:51.1337995Z ##[debug]=> true 2024-04-03T11:40:51.1339407Z ##[debug]Result: true 2024-04-03T11:40:51.1341487Z ##[debug]Starting: Download build artifacts 2024-04-03T11:40:51.1417520Z Prepare all required actions 2024-04-03T11:40:51.1418131Z Getting action download info 2024-04-03T11:40:51.3086803Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2024-04-03T11:40:52.3988825Z ##[debug]Download 'https://api.github.com/repos/seemethere/download-artifact-s3/tarball/1da556a7aa0a088e3153970611f6c432d58e80e6' to '/home/pytorchci/actions-runner/_work/_actions/_temp_fe4038dc-d5ff-40a1-9bdd-82d2874d81c6/a49fa7ee-3632-4b70-b5e2-93c8f2394764.tar.gz' 2024-04-03T11:40:52.4646878Z ##[debug]Unwrap 'seemethere-download-artifact-s3-1da556a' to '/home/pytorchci/actions-runner/_work/_actions/seemethere/download-artifact-s3/v4' 2024-04-03T11:40:52.4826750Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_fe4038dc-d5ff-40a1-9bdd-82d2874d81c6/a49fa7ee-3632-4b70-b5e2-93c8f2394764.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/seemethere/download-artifact-s3/v4'. 2024-04-03T11:40:52.4908015Z Download action repository 'actions/download-artifact@v3' (SHA:9bc31d5ccc31df68ecc42ccf4149144866c47d8a) 2024-04-03T11:40:53.2729080Z ##[debug]Download 'https://api.github.com/repos/actions/download-artifact/tarball/9bc31d5ccc31df68ecc42ccf4149144866c47d8a' to '/home/pytorchci/actions-runner/_work/_actions/_temp_fc862edd-dbe9-49c2-bbcc-b51b58229ba6/a2190fe8-d768-4f0a-a975-5649013763dc.tar.gz' 2024-04-03T11:40:53.2945581Z ##[debug]Unwrap 'actions-download-artifact-9bc31d5' to '/home/pytorchci/actions-runner/_work/_actions/actions/download-artifact/v3' 2024-04-03T11:40:53.3035532Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_fc862edd-dbe9-49c2-bbcc-b51b58229ba6/a2190fe8-d768-4f0a-a975-5649013763dc.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/actions/download-artifact/v3'. 2024-04-03T11:40:53.3087684Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/seemethere/download-artifact-s3/v4/action.yml'. 2024-04-03T11:40:53.3126705Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/download-artifact/v3/action.yml'. 2024-04-03T11:40:53.3247082Z ##[debug]Loading inputs 2024-04-03T11:40:53.3248441Z ##[debug]Evaluating: inputs.build-environment 2024-04-03T11:40:53.3249023Z ##[debug]Evaluating Index: 2024-04-03T11:40:53.3249496Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:53.3250008Z ##[debug]..=> Object 2024-04-03T11:40:53.3250451Z ##[debug]..Evaluating String: 2024-04-03T11:40:53.3250943Z ##[debug]..=> 'build-environment' 2024-04-03T11:40:53.3251517Z ##[debug]=> 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:40:53.3252142Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:40:53.3254196Z ##[debug]Loading env 2024-04-03T11:40:53.3261130Z ##[group]Run ./.github/actions/download-build-artifacts 2024-04-03T11:40:53.3261820Z with: 2024-04-03T11:40:53.3262256Z name: linux-focal-rocm6.0-py3.8 2024-04-03T11:40:53.3262848Z s3-bucket: gha-artifacts 2024-04-03T11:40:53.3263346Z env: 2024-04-03T11:40:53.3263748Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:53.3264357Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:53.3265474Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:53.3266526Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:53.3267098Z AWS_REGION: us-east-1 2024-04-03T11:40:53.3267840Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:53.3268593Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:53.3278265Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:53.3278772Z ##[endgroup] 2024-04-03T11:40:53.3283651Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:40:53.3285199Z ##[debug]Evaluating: (success() && !inputs.use-gha) 2024-04-03T11:40:53.3285949Z ##[debug]Evaluating And: 2024-04-03T11:40:53.3286558Z ##[debug]..Evaluating success: 2024-04-03T11:40:53.3287205Z ##[debug]..=> true 2024-04-03T11:40:53.3287772Z ##[debug]..Evaluating Not: 2024-04-03T11:40:53.3288398Z ##[debug]....Evaluating Index: 2024-04-03T11:40:53.3289035Z ##[debug]......Evaluating inputs: 2024-04-03T11:40:53.3289653Z ##[debug]......=> Object 2024-04-03T11:40:53.3290257Z ##[debug]......Evaluating String: 2024-04-03T11:40:53.3290867Z ##[debug]......=> 'use-gha' 2024-04-03T11:40:53.3291453Z ##[debug]....=> '' 2024-04-03T11:40:53.3291964Z ##[debug]..=> true 2024-04-03T11:40:53.3292526Z ##[debug]=> true 2024-04-03T11:40:53.3293244Z ##[debug]Expanded: (true && !'') 2024-04-03T11:40:53.3293957Z ##[debug]Result: true 2024-04-03T11:40:53.3294698Z ##[debug]Starting: run 2024-04-03T11:40:53.3320169Z ##[debug]Loading inputs 2024-04-03T11:40:53.3321460Z ##[debug]Evaluating: inputs.name 2024-04-03T11:40:53.3321978Z ##[debug]Evaluating Index: 2024-04-03T11:40:53.3322437Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:53.3322902Z ##[debug]..=> Object 2024-04-03T11:40:53.3323329Z ##[debug]..Evaluating String: 2024-04-03T11:40:53.3323790Z ##[debug]..=> 'name' 2024-04-03T11:40:53.3324269Z ##[debug]=> 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:40:53.3324874Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:40:53.3325823Z ##[debug]Evaluating: inputs.s3-bucket 2024-04-03T11:40:53.3326404Z ##[debug]Evaluating Index: 2024-04-03T11:40:53.3326926Z ##[debug]..Evaluating inputs: 2024-04-03T11:40:53.3327437Z ##[debug]..=> Object 2024-04-03T11:40:53.3327903Z ##[debug]..Evaluating String: 2024-04-03T11:40:53.3328405Z ##[debug]..=> 's3-bucket' 2024-04-03T11:40:53.3328896Z ##[debug]=> 'gha-artifacts' 2024-04-03T11:40:53.3329431Z ##[debug]Result: 'gha-artifacts' 2024-04-03T11:40:53.3331514Z ##[debug]Loading env 2024-04-03T11:40:53.3338577Z ##[group]Run seemethere/download-artifact-s3@v4 2024-04-03T11:40:53.3339319Z with: 2024-04-03T11:40:53.3339832Z name: linux-focal-rocm6.0-py3.8 2024-04-03T11:40:53.3340329Z s3-bucket: gha-artifacts 2024-04-03T11:40:53.3340851Z region: us-east-1 2024-04-03T11:40:53.3341281Z env: 2024-04-03T11:40:53.3341678Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:40:53.3342272Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:40:53.3343386Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:40:53.3344442Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:40:53.3345006Z AWS_REGION: us-east-1 2024-04-03T11:40:53.3345607Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:40:53.3346340Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:40:53.3354969Z AWS_SESSION_TOKEN: *** 2024-04-03T11:40:53.3355397Z ##[endgroup] 2024-04-03T11:40:53.8314933Z ##[debug]Resolved path is /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:40:53.8387492Z ##[debug]{"Bucket":"gha-artifacts","Prefix":"pytorch/pytorch/8536952077/linux-focal-rocm6.0-py3.8/"} 2024-04-03T11:40:53.8627572Z (node:3036341) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-04-03T11:40:53.8629876Z 2024-04-03T11:40:53.8630696Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-04-03T11:40:53.8633008Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-04-03T11:40:53.8635724Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-04-03T11:40:54.2337045Z Found 1 objects with prefix pytorch/pytorch/8536952077/linux-focal-rocm6.0-py3.8/ 2024-04-03T11:40:54.2340484Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-04-03T11:40:54.2344847Z ##[debug]S3 download uri: s3://gha-artifacts/pytorch/pytorch/8536952077/linux-focal-rocm6.0-py3.8/artifacts.zip 2024-04-03T11:41:11.8114305Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-04-03T11:41:11.8131270Z Artifact download has finished successfully 2024-04-03T11:41:11.8398289Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:41:11.8406173Z ##[debug]Set output download-path = /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:41:11.8408985Z ##[debug]Finished: run 2024-04-03T11:41:11.8418196Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:11.8422801Z ##[debug]Evaluating: (success() && inputs.use-gha) 2024-04-03T11:41:11.8424766Z ##[debug]Evaluating And: 2024-04-03T11:41:11.8426330Z ##[debug]..Evaluating success: 2024-04-03T11:41:11.8428007Z ##[debug]..=> true 2024-04-03T11:41:11.8429452Z ##[debug]..Evaluating Index: 2024-04-03T11:41:11.8431065Z ##[debug]....Evaluating inputs: 2024-04-03T11:41:11.8432661Z ##[debug]....=> Object 2024-04-03T11:41:11.8434159Z ##[debug]....Evaluating String: 2024-04-03T11:41:11.8435771Z ##[debug]....=> 'use-gha' 2024-04-03T11:41:11.8437254Z ##[debug]..=> '' 2024-04-03T11:41:11.8438582Z ##[debug]=> '' 2024-04-03T11:41:11.8440287Z ##[debug]Expanded: (true && '') 2024-04-03T11:41:11.8442043Z ##[debug]Result: '' 2024-04-03T11:41:11.8447991Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:11.8450938Z ##[debug]Evaluating: success() 2024-04-03T11:41:11.8452593Z ##[debug]Evaluating success: 2024-04-03T11:41:11.8454195Z ##[debug]=> true 2024-04-03T11:41:11.8455748Z ##[debug]Result: true 2024-04-03T11:41:11.8457470Z ##[debug]Starting: run 2024-04-03T11:41:11.8484192Z ##[debug]Loading inputs 2024-04-03T11:41:11.8486960Z ##[debug]Loading env 2024-04-03T11:41:11.8495730Z ##[group]Run unzip -o artifacts.zip 2024-04-03T11:41:11.8496255Z unzip -o artifacts.zip 2024-04-03T11:41:11.8566466Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:11.8567653Z env: 2024-04-03T11:41:11.8568300Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:11.8569257Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:11.8571081Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:11.8573239Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:11.8574130Z AWS_REGION: us-east-1 2024-04-03T11:41:11.8575121Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:11.8576285Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:11.8592320Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:11.8593106Z ##[endgroup] 2024-04-03T11:41:11.8646449Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/6b1fa77c-e772-4dfd-9aea-b504783e0b2d.sh 2024-04-03T11:41:11.8694527Z Archive: artifacts.zip 2024-04-03T11:41:11.8695680Z creating: dist/ 2024-04-03T11:41:13.9632297Z inflating: dist/torch-2.4.0a0+gitd253bd7-cp38-cp38-linux_x86_64.whl 2024-04-03T11:41:13.9782095Z inflating: dist/.ninja_log 2024-04-03T11:41:13.9783572Z creating: build/custom_test_artifacts/ 2024-04-03T11:41:13.9785519Z creating: build/custom_test_artifacts/custom-op-build/ 2024-04-03T11:41:13.9787830Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2024-04-03T11:41:13.9790553Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2024-04-03T11:41:13.9793699Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2024-04-03T11:41:13.9796721Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/ 2024-04-03T11:41:13.9799756Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-04-03T11:41:13.9802953Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-04-03T11:41:13.9806069Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-04-03T11:41:13.9809605Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-04-03T11:41:13.9813245Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-04-03T11:41:13.9817043Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-04-03T11:41:13.9820506Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-04-03T11:41:13.9824235Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-04-03T11:41:13.9827985Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-04-03T11:41:13.9831656Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-04-03T11:41:13.9835329Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-04-03T11:41:13.9839072Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-04-03T11:41:13.9842802Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-04-03T11:41:13.9845973Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2024-04-03T11:41:13.9848784Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2024-04-03T11:41:13.9851692Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2024-04-03T11:41:13.9854665Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2024-04-03T11:41:13.9857972Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2024-04-03T11:41:13.9862073Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2024-04-03T11:41:13.9865647Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2024-04-03T11:41:13.9868987Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2024-04-03T11:41:13.9871915Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2024-04-03T11:41:13.9873449Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2024-04-03T11:41:13.9874763Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2024-04-03T11:41:13.9876067Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2024-04-03T11:41:13.9877352Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2024-04-03T11:41:13.9878633Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2024-04-03T11:41:13.9979731Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2024-04-03T11:41:13.9981210Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2024-04-03T11:41:13.9982799Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2024-04-03T11:41:13.9984571Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2024-04-03T11:41:13.9986262Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2024-04-03T11:41:13.9987847Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2024-04-03T11:41:13.9989477Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2024-04-03T11:41:13.9991712Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2024-04-03T11:41:13.9995823Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2024-04-03T11:41:13.9999900Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2024-04-03T11:41:14.0004153Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2024-04-03T11:41:14.0009187Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2024-04-03T11:41:14.0098201Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2024-04-03T11:41:14.0102406Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-04-03T11:41:14.0105889Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2024-04-03T11:41:14.0109046Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2024-04-03T11:41:14.0111962Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2024-04-03T11:41:14.0114094Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2024-04-03T11:41:14.0116211Z inflating: build/custom_test_artifacts/custom-op-build/detect_rocm_version.cc 2024-04-03T11:41:14.0118347Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_data_type.cc 2024-04-03T11:41:14.0120562Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_compute_type.cc 2024-04-03T11:41:14.0122890Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_getIndexFromAlgo.cc 2024-04-03T11:41:14.0125035Z inflating: build/custom_test_artifacts/custom-op-build/hip_new_types.cc 2024-04-03T11:41:14.0126953Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2024-04-03T11:41:14.0128753Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2024-04-03T11:41:14.0130622Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2024-04-03T11:41:14.0223119Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2024-04-03T11:41:14.0289251Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2024-04-03T11:41:14.0291010Z creating: build/custom_test_artifacts/jit-hook-build/ 2024-04-03T11:41:14.0292950Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2024-04-03T11:41:14.0294860Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2024-04-03T11:41:14.0297079Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2024-04-03T11:41:14.0299216Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/ 2024-04-03T11:41:14.0301519Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-04-03T11:41:14.0303795Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-04-03T11:41:14.0306005Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-04-03T11:41:14.0308544Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-04-03T11:41:14.0311121Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-04-03T11:41:14.0313414Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-04-03T11:41:14.0315689Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-04-03T11:41:14.0318325Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-04-03T11:41:14.0320987Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-04-03T11:41:14.0323584Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-04-03T11:41:14.0326196Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-04-03T11:41:14.0328883Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-04-03T11:41:14.0331766Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-04-03T11:41:14.0334049Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2024-04-03T11:41:14.0336038Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2024-04-03T11:41:14.0338096Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2024-04-03T11:41:14.0340512Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2024-04-03T11:41:14.0342989Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2024-04-03T11:41:14.0345750Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2024-04-03T11:41:14.0348388Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2024-04-03T11:41:14.0350898Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2024-04-03T11:41:14.0353472Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2024-04-03T11:41:14.0356040Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2024-04-03T11:41:14.0358625Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2024-04-03T11:41:14.0361188Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2024-04-03T11:41:14.0363736Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2024-04-03T11:41:14.0366472Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2024-04-03T11:41:14.0405931Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2024-04-03T11:41:14.0408741Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-04-03T11:41:14.0411554Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2024-04-03T11:41:14.0413842Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2024-04-03T11:41:14.0415936Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2024-04-03T11:41:14.0417998Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2024-04-03T11:41:14.0420358Z inflating: build/custom_test_artifacts/jit-hook-build/detect_rocm_version.cc 2024-04-03T11:41:14.0422487Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_data_type.cc 2024-04-03T11:41:14.0424709Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_compute_type.cc 2024-04-03T11:41:14.0427024Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_getIndexFromAlgo.cc 2024-04-03T11:41:14.0429192Z inflating: build/custom_test_artifacts/jit-hook-build/hip_new_types.cc 2024-04-03T11:41:14.0431078Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2024-04-03T11:41:14.0432866Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2024-04-03T11:41:14.0434700Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2024-04-03T11:41:14.0468203Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2024-04-03T11:41:14.0470022Z creating: build/custom_test_artifacts/custom-backend-build/ 2024-04-03T11:41:14.0471785Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2024-04-03T11:41:14.0473853Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2024-04-03T11:41:14.0476279Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2024-04-03T11:41:14.0479124Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/ 2024-04-03T11:41:14.0481785Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-04-03T11:41:14.0484376Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-04-03T11:41:14.0486931Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-04-03T11:41:14.0489945Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-04-03T11:41:14.0492849Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-04-03T11:41:14.0495472Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-04-03T11:41:14.0498047Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-04-03T11:41:14.0501329Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-04-03T11:41:14.0504312Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-04-03T11:41:14.0507296Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-04-03T11:41:14.0510236Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-04-03T11:41:14.0513181Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-04-03T11:41:14.0516123Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-04-03T11:41:14.0518706Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2024-04-03T11:41:14.0520988Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2024-04-03T11:41:14.0523427Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2024-04-03T11:41:14.0526173Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2024-04-03T11:41:14.0528910Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2024-04-03T11:41:14.0532058Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2024-04-03T11:41:14.0535089Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2024-04-03T11:41:14.0537931Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2024-04-03T11:41:14.0540887Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2024-04-03T11:41:14.0543754Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2024-04-03T11:41:14.0546734Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2024-04-03T11:41:14.0549597Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2024-04-03T11:41:14.0552450Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2024-04-03T11:41:14.0555551Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2024-04-03T11:41:14.0630035Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2024-04-03T11:41:14.0633141Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2024-04-03T11:41:14.0636109Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2024-04-03T11:41:14.0639774Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2024-04-03T11:41:14.0642927Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2024-04-03T11:41:14.0645873Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2024-04-03T11:41:14.0648928Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2024-04-03T11:41:14.0651954Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2024-04-03T11:41:14.0655018Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2024-04-03T11:41:14.0658101Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2024-04-03T11:41:14.0661472Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2024-04-03T11:41:14.0664753Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2024-04-03T11:41:14.0717423Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2024-04-03T11:41:14.0720754Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-04-03T11:41:14.0723553Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2024-04-03T11:41:14.0726089Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2024-04-03T11:41:14.0728508Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2024-04-03T11:41:14.0730871Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2024-04-03T11:41:14.0733238Z inflating: build/custom_test_artifacts/custom-backend-build/detect_rocm_version.cc 2024-04-03T11:41:14.0735988Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_data_type.cc 2024-04-03T11:41:14.0738498Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_compute_type.cc 2024-04-03T11:41:14.0741492Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_getIndexFromAlgo.cc 2024-04-03T11:41:14.0743998Z inflating: build/custom_test_artifacts/custom-backend-build/hip_new_types.cc 2024-04-03T11:41:14.0746129Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2024-04-03T11:41:14.0748249Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2024-04-03T11:41:14.0750339Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2024-04-03T11:41:14.0835358Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2024-04-03T11:41:14.0882895Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2024-04-03T11:41:14.0885169Z creating: build/lib/ 2024-04-03T11:41:14.0978437Z inflating: build/lib/libprotobuf-lite.a 2024-04-03T11:41:14.1456082Z inflating: build/lib/libprotobuf.a 2024-04-03T11:41:14.1982323Z inflating: build/lib/libprotoc.a 2024-04-03T11:41:14.2052415Z inflating: build/lib/libpthreadpool.a 2024-04-03T11:41:14.2054145Z inflating: build/lib/libcpuinfo.a 2024-04-03T11:41:14.2055701Z inflating: build/lib/libcpuinfo_internals.a 2024-04-03T11:41:14.2057238Z inflating: build/lib/libclog.a 2024-04-03T11:41:14.2058638Z inflating: build/lib/libqnnpack.a 2024-04-03T11:41:14.2060299Z inflating: build/lib/libpytorch_qnnpack.a 2024-04-03T11:41:14.2062006Z inflating: build/lib/libnnpack_reference_layers.a 2024-04-03T11:41:14.2066030Z inflating: build/lib/libnnpack.a 2024-04-03T11:41:14.2136022Z inflating: build/lib/libgtest.a 2024-04-03T11:41:14.2158482Z inflating: build/lib/libgmock.a 2024-04-03T11:41:14.2160009Z inflating: build/lib/libgmock_main.a 2024-04-03T11:41:14.2161465Z inflating: build/lib/libgtest_main.a 2024-04-03T11:41:14.2241501Z inflating: build/lib/libbenchmark.a 2024-04-03T11:41:14.2243072Z inflating: build/lib/libbenchmark_main.a 2024-04-03T11:41:14.2435627Z inflating: build/lib/libXNNPACK.a 2024-04-03T11:41:14.2443513Z inflating: build/lib/libittnotify.a 2024-04-03T11:41:14.2512694Z inflating: build/lib/libasmjit.a 2024-04-03T11:41:14.3824944Z inflating: build/lib/libfbgemm.a 2024-04-03T11:41:14.3855668Z inflating: build/lib/libtensorpipe_uv.a 2024-04-03T11:41:14.4456600Z inflating: build/lib/libtensorpipe.a 2024-04-03T11:41:14.4458064Z inflating: build/lib/libfoxi_loader.a 2024-04-03T11:41:14.4582429Z inflating: build/lib/libgloo.a 2024-04-03T11:41:14.4988644Z inflating: build/lib/libgloo_hip.a 2024-04-03T11:41:14.5033611Z inflating: build/lib/libonnx_proto.a 2024-04-03T11:41:14.5743588Z inflating: build/lib/libonnx.a 2024-04-03T11:41:15.5740287Z inflating: build/lib/libdnnl.a 2024-04-03T11:41:15.5764106Z inflating: build/lib/libfmt.a 2024-04-03T11:41:15.6101123Z inflating: build/lib/libkineto.a 2024-04-03T11:41:15.6195547Z inflating: build/lib/libc10.so 2024-04-03T11:41:15.6197150Z inflating: build/lib/libtorch_global_deps.so 2024-04-03T11:41:15.6244198Z inflating: build/lib/libc10_hip.so 2024-04-03T11:41:15.6245756Z inflating: build/lib/libcaffe2_nvrtc.so 2024-04-03T11:41:15.6288934Z inflating: build/lib/libcaffe2_protos.a 2024-04-03T11:41:18.3948762Z inflating: build/lib/libtorch_cpu.so 2024-04-03T11:41:18.3953165Z inflating: build/lib/libunbox_lib.a 2024-04-03T11:41:18.3958110Z inflating: build/lib/libshm.so 2024-04-03T11:41:22.0730237Z inflating: build/lib/libtorch_hip.so 2024-04-03T11:41:22.0731757Z inflating: build/lib/libtorch.so 2024-04-03T11:41:22.0755477Z inflating: build/lib/libjitbackend_test.so 2024-04-03T11:41:22.0783268Z inflating: build/lib/libbackend_with_compiler.so 2024-04-03T11:41:22.0844730Z inflating: build/lib/libtorchbind_test.so 2024-04-03T11:41:22.2798793Z inflating: build/lib/libtorch_python.so 2024-04-03T11:41:22.2835968Z inflating: build/lib/libnnapi_backend.so 2024-04-03T11:41:22.2837962Z creating: build/bin/ 2024-04-03T11:41:22.2839171Z creating: build/bin/CMakeFiles/ 2024-04-03T11:41:22.2894406Z inflating: build/bin/memory_overlapping_test 2024-04-03T11:41:22.2896113Z inflating: build/bin/cmake_install.cmake 2024-04-03T11:41:22.2952309Z inflating: build/bin/mobile_memory_cleanup 2024-04-03T11:41:22.2953973Z inflating: build/bin/CTestTestfile.cmake 2024-04-03T11:41:22.3422288Z inflating: build/bin/protoc-3.13.0.0 2024-04-03T11:41:22.3892267Z inflating: build/bin/protoc 2024-04-03T11:41:22.3946505Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2024-04-03T11:41:22.4002634Z inflating: build/bin/c10_DeviceGuard_test 2024-04-03T11:41:22.4065800Z inflating: build/bin/c10_DispatchKeySet_test 2024-04-03T11:41:22.4120657Z inflating: build/bin/c10_Device_test 2024-04-03T11:41:22.4173436Z inflating: build/bin/c10_StreamGuard_test 2024-04-03T11:41:22.4229659Z inflating: build/bin/c10_SymInt_test 2024-04-03T11:41:22.4287322Z inflating: build/bin/c10_Scalar_test 2024-04-03T11:41:22.4346313Z inflating: build/bin/c10_InlineDeviceGuard_test 2024-04-03T11:41:22.4407687Z inflating: build/bin/c10_SizesAndStrides_test 2024-04-03T11:41:22.4468241Z inflating: build/bin/c10_InlineStreamGuard_test 2024-04-03T11:41:22.4544209Z inflating: build/bin/c10_cow_test 2024-04-03T11:41:22.4602531Z inflating: build/bin/c10_Bitset_test 2024-04-03T11:41:22.4655006Z inflating: build/bin/c10_ConstexprCrc_test 2024-04-03T11:41:22.4716586Z inflating: build/bin/c10_LeftRight_test 2024-04-03T11:41:22.4771162Z inflating: build/bin/c10_DeadlockDetection_test 2024-04-03T11:41:22.4826651Z inflating: build/bin/c10_Half_test 2024-04-03T11:41:22.4886369Z inflating: build/bin/c10_Metaprogramming_test 2024-04-03T11:41:22.4943219Z inflating: build/bin/c10_Synchronized_test 2024-04-03T11:41:22.5002699Z inflating: build/bin/c10_ThreadLocal_test 2024-04-03T11:41:22.5057954Z inflating: build/bin/c10_TypeIndex_test 2024-04-03T11:41:22.5113753Z inflating: build/bin/c10_TypeList_test 2024-04-03T11:41:22.5167442Z inflating: build/bin/c10_TypeTraits_test 2024-04-03T11:41:22.5224207Z inflating: build/bin/c10_accumulate_test 2024-04-03T11:41:22.5283404Z inflating: build/bin/c10_bfloat16_test 2024-04-03T11:41:22.5343929Z inflating: build/bin/c10_complex_math_test 2024-04-03T11:41:22.5403468Z inflating: build/bin/c10_complex_test 2024-04-03T11:41:22.5457683Z inflating: build/bin/c10_bit_cast_test 2024-04-03T11:41:22.5515381Z inflating: build/bin/c10_exception_test 2024-04-03T11:41:22.5569219Z inflating: build/bin/c10_flags_test 2024-04-03T11:41:22.5624043Z inflating: build/bin/c10_generic_math_test 2024-04-03T11:41:22.5678509Z inflating: build/bin/c10_irange_test 2024-04-03T11:41:22.5739268Z inflating: build/bin/c10_logging_test 2024-04-03T11:41:22.5914695Z inflating: build/bin/c10_intrusive_ptr_test 2024-04-03T11:41:22.5994603Z inflating: build/bin/c10_optional_test 2024-04-03T11:41:22.6052473Z inflating: build/bin/c10_registry_test 2024-04-03T11:41:22.6119986Z inflating: build/bin/c10_ordered_preserving_dict_test 2024-04-03T11:41:22.6281209Z inflating: build/bin/c10_small_vector_test 2024-04-03T11:41:22.6337137Z inflating: build/bin/c10_ssize_test 2024-04-03T11:41:22.6393292Z inflating: build/bin/c10_string_util_test 2024-04-03T11:41:22.6447745Z inflating: build/bin/c10_tempfile_test 2024-04-03T11:41:22.6512210Z inflating: build/bin/c10_string_view_test 2024-04-03T11:41:22.6572695Z inflating: build/bin/c10_typeid_test 2024-04-03T11:41:22.6625897Z inflating: build/bin/c10_intrusive_ptr_benchmark 2024-04-03T11:41:22.6678934Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2024-04-03T11:41:22.6732793Z inflating: build/bin/c10_hip_HIPAssertionsTest_1_var_test 2024-04-03T11:41:22.6785593Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_stream 2024-04-03T11:41:22.6838881Z inflating: build/bin/c10_hip_HIPAssertionsTest_from_2_processes 2024-04-03T11:41:22.6893185Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2024-04-03T11:41:22.6946466Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2024-04-03T11:41:22.6999038Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2024-04-03T11:41:22.7051245Z inflating: build/bin/c10_hip_HIPTest 2024-04-03T11:41:22.7399350Z inflating: build/bin/vec_test_all_types_DEFAULT 2024-04-03T11:41:22.7761363Z inflating: build/bin/vec_test_all_types_AVX512 2024-04-03T11:41:22.8140243Z inflating: build/bin/vec_test_all_types_AVX2 2024-04-03T11:41:22.8199889Z inflating: build/bin/TCPStoreTest 2024-04-03T11:41:22.8256691Z inflating: build/bin/FileStoreTest 2024-04-03T11:41:22.8315184Z inflating: build/bin/test_edge_op_registration 2024-04-03T11:41:22.8373110Z inflating: build/bin/HashStoreTest 2024-04-03T11:41:22.8444449Z inflating: build/bin/ProcessGroupGlooTest 2024-04-03T11:41:22.8446227Z inflating: build/bin/example_allreduce 2024-04-03T11:41:22.8451378Z inflating: build/bin/torch_shm_manager 2024-04-03T11:41:22.8519159Z inflating: build/bin/static_runtime_bench 2024-04-03T11:41:22.8787370Z inflating: build/bin/static_runtime_test 2024-04-03T11:41:22.8866445Z inflating: build/bin/Dict_test 2024-04-03T11:41:22.8923000Z inflating: build/bin/Dimname_test 2024-04-03T11:41:22.8993360Z inflating: build/bin/MaybeOwned_test 2024-04-03T11:41:22.9056454Z inflating: build/bin/NamedTensor_test 2024-04-03T11:41:22.9121851Z inflating: build/bin/apply_utils_test 2024-04-03T11:41:22.9185267Z inflating: build/bin/atest 2024-04-03T11:41:22.9253479Z inflating: build/bin/basic 2024-04-03T11:41:22.9314237Z inflating: build/bin/broadcast_test 2024-04-03T11:41:22.9370212Z inflating: build/bin/cpu_allocator_test 2024-04-03T11:41:22.9433112Z inflating: build/bin/cpu_generator_test 2024-04-03T11:41:22.9491360Z inflating: build/bin/cpu_profiling_allocator_test 2024-04-03T11:41:22.9590563Z inflating: build/bin/cpu_rng_test 2024-04-03T11:41:22.9644645Z inflating: build/bin/dispatch_key_set_test 2024-04-03T11:41:22.9700374Z inflating: build/bin/dlconvertor_test 2024-04-03T11:41:22.9762364Z inflating: build/bin/extension_backend_test 2024-04-03T11:41:22.9821837Z inflating: build/bin/half_test 2024-04-03T11:41:22.9923701Z inflating: build/bin/ivalue_test 2024-04-03T11:41:22.9976857Z inflating: build/bin/lazy_tensor_test 2024-04-03T11:41:23.0036696Z inflating: build/bin/math_kernel_test 2024-04-03T11:41:23.0095662Z inflating: build/bin/memory_format_test 2024-04-03T11:41:23.0151479Z inflating: build/bin/operator_name_test 2024-04-03T11:41:23.0211484Z inflating: build/bin/native_test 2024-04-03T11:41:23.0267292Z inflating: build/bin/operators_test 2024-04-03T11:41:23.0323784Z inflating: build/bin/packedtensoraccessor_test 2024-04-03T11:41:23.0395657Z inflating: build/bin/pow_test 2024-04-03T11:41:23.0450411Z inflating: build/bin/reduce_ops_test 2024-04-03T11:41:23.0512617Z inflating: build/bin/quantized_test 2024-04-03T11:41:23.0568865Z inflating: build/bin/reportMemoryUsage_test 2024-04-03T11:41:23.0630436Z inflating: build/bin/scalar_tensor_test 2024-04-03T11:41:23.0693386Z inflating: build/bin/scalar_test 2024-04-03T11:41:23.0749445Z inflating: build/bin/StorageUtils_test 2024-04-03T11:41:23.0806480Z inflating: build/bin/stride_properties_test 2024-04-03T11:41:23.0891984Z inflating: build/bin/tensor_iterator_test 2024-04-03T11:41:23.0895162Z inflating: build/bin/thread_init_test 2024-04-03T11:41:23.0953359Z inflating: build/bin/test_parallel 2024-04-03T11:41:23.1013496Z inflating: build/bin/type_ptr_test 2024-04-03T11:41:23.1078128Z inflating: build/bin/type_test 2024-04-03T11:41:23.1135380Z inflating: build/bin/undefined_tensor_test 2024-04-03T11:41:23.1137118Z inflating: build/bin/verify_api_visibility 2024-04-03T11:41:23.1212747Z inflating: build/bin/legacy_vmap_test 2024-04-03T11:41:23.1268612Z inflating: build/bin/weakref_test 2024-04-03T11:41:23.1324893Z inflating: build/bin/wrapdim_test 2024-04-03T11:41:23.1389343Z inflating: build/bin/IListRef_test 2024-04-03T11:41:23.1502882Z inflating: build/bin/List_test 2024-04-03T11:41:23.1558986Z inflating: build/bin/xla_tensor_test 2024-04-03T11:41:23.1689577Z inflating: build/bin/kernel_function_legacy_test 2024-04-03T11:41:23.1761364Z inflating: build/bin/KernelFunction_test 2024-04-03T11:41:23.1864501Z inflating: build/bin/kernel_function_test 2024-04-03T11:41:23.2002062Z inflating: build/bin/kernel_lambda_legacy_test 2024-04-03T11:41:23.2112571Z inflating: build/bin/kernel_lambda_test 2024-04-03T11:41:23.2178043Z inflating: build/bin/kernel_stackbased_test 2024-04-03T11:41:23.2232896Z inflating: build/bin/CppSignature_test 2024-04-03T11:41:23.2335807Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2024-04-03T11:41:23.2388439Z inflating: build/bin/op_allowlist_test 2024-04-03T11:41:23.2448359Z inflating: build/bin/backend_fallback_test 2024-04-03T11:41:23.2765093Z inflating: build/bin/op_registration_test 2024-04-03T11:41:23.2833621Z inflating: build/bin/inline_container_test 2024-04-03T11:41:23.2891406Z inflating: build/bin/hip_apply_test 2024-04-03T11:41:23.2945176Z inflating: build/bin/hip_complex_math_test 2024-04-03T11:41:23.2998438Z inflating: build/bin/hip_complex_test 2024-04-03T11:41:23.3050963Z inflating: build/bin/hip_distributions_test 2024-04-03T11:41:23.3105585Z inflating: build/bin/hip_generator_test 2024-04-03T11:41:23.3158602Z inflating: build/bin/hip_half_test 2024-04-03T11:41:23.3211423Z inflating: build/bin/hip_integer_divider_test 2024-04-03T11:41:23.3267252Z inflating: build/bin/hip_dlconvertor_test 2024-04-03T11:41:23.3321361Z inflating: build/bin/hip_optional_test 2024-04-03T11:41:23.3375215Z inflating: build/bin/hip_packedtensoraccessor_test 2024-04-03T11:41:23.3428544Z inflating: build/bin/hip_vectorized_test 2024-04-03T11:41:23.4022991Z inflating: build/bin/test_jit 2024-04-03T11:41:23.4864539Z inflating: build/bin/test_tensorexpr 2024-04-03T11:41:23.4882957Z inflating: build/bin/tutorial_tensorexpr 2024-04-03T11:41:23.4942010Z inflating: build/bin/test_dist_autograd 2024-04-03T11:41:23.5015760Z inflating: build/bin/test_cpp_rpc 2024-04-03T11:41:23.5017854Z inflating: build/bin/parallel_benchmark 2024-04-03T11:41:23.5090930Z inflating: build/bin/test_mobile_nnc 2024-04-03T11:41:23.5101788Z inflating: build/bin/aot_model_compiler_test 2024-04-03T11:41:23.5466725Z inflating: build/bin/test_lazy 2024-04-03T11:41:23.6710394Z inflating: build/bin/test_api 2024-04-03T11:41:23.6711833Z creating: .additional_ci_files/ 2024-04-03T11:41:23.6751300Z inflating: .additional_ci_files/test-times.json 2024-04-03T11:41:23.6905684Z inflating: .additional_ci_files/test-class-times.json 2024-04-03T11:41:23.6928345Z ##[debug]Finished: run 2024-04-03T11:41:23.6936171Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:23.6940310Z ##[debug]Evaluating: success() 2024-04-03T11:41:23.6942058Z ##[debug]Evaluating success: 2024-04-03T11:41:23.6943787Z ##[debug]=> true 2024-04-03T11:41:23.6945471Z ##[debug]Result: true 2024-04-03T11:41:23.6947343Z ##[debug]Starting: run 2024-04-03T11:41:23.6975252Z ##[debug]Loading inputs 2024-04-03T11:41:23.6978090Z ##[debug]Loading env 2024-04-03T11:41:23.6995256Z ##[group]Run rm artifacts.zip 2024-04-03T11:41:23.6996522Z rm artifacts.zip 2024-04-03T11:41:23.7052413Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:23.7053273Z env: 2024-04-03T11:41:23.7053620Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:23.7054126Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:23.7055054Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:23.7055949Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:23.7056421Z AWS_REGION: us-east-1 2024-04-03T11:41:23.7056950Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:23.7057554Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:23.7065562Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:23.7065985Z ##[endgroup] 2024-04-03T11:41:23.7097832Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/96e67101-9c88-4a42-a65d-e1aebf59c562.sh 2024-04-03T11:41:23.8367059Z ##[debug]Finished: run 2024-04-03T11:41:23.8373827Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:23.8377090Z ##[debug]Evaluating: success() 2024-04-03T11:41:23.8378748Z ##[debug]Evaluating success: 2024-04-03T11:41:23.8380533Z ##[debug]=> true 2024-04-03T11:41:23.8382105Z ##[debug]Result: true 2024-04-03T11:41:23.8383864Z ##[debug]Starting: run 2024-04-03T11:41:23.8410095Z ##[debug]Loading inputs 2024-04-03T11:41:23.8412871Z ##[debug]Loading env 2024-04-03T11:41:23.8429231Z ##[group]Run df -H 2024-04-03T11:41:23.8430203Z df -H 2024-04-03T11:41:23.8484111Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:23.8484714Z env: 2024-04-03T11:41:23.8485050Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:23.8485550Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:23.8486469Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:23.8487346Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:23.8487806Z AWS_REGION: us-east-1 2024-04-03T11:41:23.8488302Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:23.8489073Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:23.8497061Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:23.8497471Z ##[endgroup] 2024-04-03T11:41:23.8524531Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/77f1c341-3852-4dcc-b32c-63103a34bc77.sh 2024-04-03T11:41:23.8575093Z Filesystem Size Used Avail Use% Mounted on 2024-04-03T11:41:23.8576538Z tmpfs 55G 2.8M 55G 1% /run 2024-04-03T11:41:23.8577960Z /dev/nvme0n1p1 984G 567G 367G 61% / 2024-04-03T11:41:23.8579397Z tmpfs 271G 17k 271G 1% /dev/shm 2024-04-03T11:41:23.8581069Z tmpfs 5.3M 0 5.3M 0% /run/lock 2024-04-03T11:41:23.8582703Z tmpfs 55G 13k 55G 1% /run/user/1123 2024-04-03T11:41:23.8584380Z tmpfs 55G 0 55G 0% /run/user/1126 2024-04-03T11:41:23.8593495Z ##[debug]Finished: run 2024-04-03T11:41:23.8604188Z ##[debug]Finishing: Download build artifacts 2024-04-03T11:41:23.8630787Z ##[debug]Evaluating condition for step: 'Download TD artifacts' 2024-04-03T11:41:23.8634480Z ##[debug]Evaluating: success() 2024-04-03T11:41:23.8636189Z ##[debug]Evaluating success: 2024-04-03T11:41:23.8637865Z ##[debug]=> true 2024-04-03T11:41:23.8639453Z ##[debug]Result: true 2024-04-03T11:41:23.8641409Z ##[debug]Starting: Download TD artifacts 2024-04-03T11:41:23.8688272Z Prepare all required actions 2024-04-03T11:41:23.8688761Z Getting action download info 2024-04-03T11:41:24.0384525Z ##[debug]Action 'seemethere/download-artifact-s3@v4' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/seemethere/download-artifact-s3/v4'. 2024-04-03T11:41:24.0394808Z ##[debug]Action 'actions/download-artifact@v3' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/actions/download-artifact/v3'. 2024-04-03T11:41:24.0403658Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/seemethere/download-artifact-s3/v4/action.yml'. 2024-04-03T11:41:24.0473164Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/download-artifact/v3/action.yml'. 2024-04-03T11:41:24.0531052Z ##[debug]Loading inputs 2024-04-03T11:41:24.0532394Z ##[debug]Loading env 2024-04-03T11:41:24.0538673Z ##[group]Run ./.github/actions/download-td-artifacts 2024-04-03T11:41:24.0539223Z with: 2024-04-03T11:41:24.0539529Z env: 2024-04-03T11:41:24.0539921Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:24.0540420Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:24.0541345Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:24.0542232Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:24.0542699Z AWS_REGION: us-east-1 2024-04-03T11:41:24.0543198Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:24.0543797Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:24.0551759Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:24.0552170Z ##[endgroup] 2024-04-03T11:41:24.0555466Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:24.0556964Z ##[debug]Evaluating: (success() && !inputs.use-gha) 2024-04-03T11:41:24.0557734Z ##[debug]Evaluating And: 2024-04-03T11:41:24.0558325Z ##[debug]..Evaluating success: 2024-04-03T11:41:24.0558954Z ##[debug]..=> true 2024-04-03T11:41:24.0559513Z ##[debug]..Evaluating Not: 2024-04-03T11:41:24.0560105Z ##[debug]....Evaluating Index: 2024-04-03T11:41:24.0560723Z ##[debug]......Evaluating inputs: 2024-04-03T11:41:24.0561337Z ##[debug]......=> Object 2024-04-03T11:41:24.0561940Z ##[debug]......Evaluating String: 2024-04-03T11:41:24.0562550Z ##[debug]......=> 'use-gha' 2024-04-03T11:41:24.0563123Z ##[debug]....=> '' 2024-04-03T11:41:24.0563629Z ##[debug]..=> true 2024-04-03T11:41:24.0564126Z ##[debug]=> true 2024-04-03T11:41:24.0564785Z ##[debug]Expanded: (true && !'') 2024-04-03T11:41:24.0565428Z ##[debug]Result: true 2024-04-03T11:41:24.0566076Z ##[debug]Starting: run 2024-04-03T11:41:24.0589179Z ##[debug]Loading inputs 2024-04-03T11:41:24.0592356Z ##[debug]Loading env 2024-04-03T11:41:24.0598597Z ##[group]Run seemethere/download-artifact-s3@v4 2024-04-03T11:41:24.0599142Z with: 2024-04-03T11:41:24.0599472Z name: td_results 2024-04-03T11:41:24.0599854Z s3-bucket: gha-artifacts 2024-04-03T11:41:24.0600279Z region: us-east-1 2024-04-03T11:41:24.0600633Z env: 2024-04-03T11:41:24.0600959Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:24.0601450Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:24.0602369Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:24.0603255Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:24.0603722Z AWS_REGION: us-east-1 2024-04-03T11:41:24.0604198Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:24.0604788Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:24.0612737Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:24.0613148Z ##[endgroup] 2024-04-03T11:41:24.5589815Z ##[debug]Resolved path is /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:41:24.5663086Z ##[debug]{"Bucket":"gha-artifacts","Prefix":"pytorch/pytorch/8536952077/td_results/"} 2024-04-03T11:41:24.5897783Z (node:3036368) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-04-03T11:41:24.5900415Z 2024-04-03T11:41:24.5901266Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-04-03T11:41:24.5903675Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-04-03T11:41:24.5906568Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-04-03T11:41:24.9573469Z Found 1 objects with prefix pytorch/pytorch/8536952077/td_results/ 2024-04-03T11:41:24.9576879Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2024-04-03T11:41:24.9581454Z ##[debug]S3 download uri: s3://gha-artifacts/pytorch/pytorch/8536952077/td_results/td_results.json 2024-04-03T11:41:25.4216853Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2024-04-03T11:41:25.4230498Z Artifact download has finished successfully 2024-04-03T11:41:25.4463330Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:41:25.4471015Z ##[debug]Set output download-path = /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:41:25.4473758Z ##[debug]Finished: run 2024-04-03T11:41:25.4481869Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:25.4485181Z ##[debug]Evaluating: (success() && inputs.use-gha) 2024-04-03T11:41:25.4486837Z ##[debug]Evaluating And: 2024-04-03T11:41:25.4488147Z ##[debug]..Evaluating success: 2024-04-03T11:41:25.4489525Z ##[debug]..=> true 2024-04-03T11:41:25.4490747Z ##[debug]..Evaluating Index: 2024-04-03T11:41:25.4492098Z ##[debug]....Evaluating inputs: 2024-04-03T11:41:25.4493425Z ##[debug]....=> Object 2024-04-03T11:41:25.4494692Z ##[debug]....Evaluating String: 2024-04-03T11:41:25.4496025Z ##[debug]....=> 'use-gha' 2024-04-03T11:41:25.4497256Z ##[debug]..=> '' 2024-04-03T11:41:25.4498388Z ##[debug]=> '' 2024-04-03T11:41:25.4500021Z ##[debug]Expanded: (true && '') 2024-04-03T11:41:25.4501693Z ##[debug]Result: '' 2024-04-03T11:41:25.4505545Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:25.4508461Z ##[debug]Evaluating: success() 2024-04-03T11:41:25.4510085Z ##[debug]Evaluating success: 2024-04-03T11:41:25.4511659Z ##[debug]=> true 2024-04-03T11:41:25.4513171Z ##[debug]Result: true 2024-04-03T11:41:25.4514967Z ##[debug]Starting: run 2024-04-03T11:41:25.4537430Z ##[debug]Loading inputs 2024-04-03T11:41:25.4540001Z ##[debug]Loading env 2024-04-03T11:41:25.4551927Z ##[group]Run mkdir -p .additional_ci_files 2024-04-03T11:41:25.4552599Z mkdir -p .additional_ci_files 2024-04-03T11:41:25.4553408Z mv td_results.json .additional_ci_files/td_results.json 2024-04-03T11:41:25.4587353Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:25.4588071Z env: 2024-04-03T11:41:25.4588463Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:25.4589066Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:25.4590163Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:25.4591225Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:25.4591778Z AWS_REGION: us-east-1 2024-04-03T11:41:25.4592374Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:25.4593082Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:25.4602112Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:25.4602536Z ##[endgroup] 2024-04-03T11:41:25.4630775Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/302d4705-8b21-4af8-a867-a2f5d15466fc.sh 2024-04-03T11:41:25.4711126Z ##[debug]Finished: run 2024-04-03T11:41:25.4721572Z ##[debug]Finishing: Download TD artifacts 2024-04-03T11:41:25.4747348Z ##[debug]Evaluating condition for step: 'Parse ref' 2024-04-03T11:41:25.4750849Z ##[debug]Evaluating: success() 2024-04-03T11:41:25.4752048Z ##[debug]Evaluating success: 2024-04-03T11:41:25.4753271Z ##[debug]=> true 2024-04-03T11:41:25.4754386Z ##[debug]Result: true 2024-04-03T11:41:25.4755722Z ##[debug]Starting: Parse ref 2024-04-03T11:41:25.4774558Z ##[debug]Loading inputs 2024-04-03T11:41:25.4776571Z ##[debug]Loading env 2024-04-03T11:41:25.4788746Z ##[group]Run .github/scripts/parse_ref.py 2024-04-03T11:41:25.4789821Z .github/scripts/parse_ref.py 2024-04-03T11:41:25.4846530Z shell: /usr/bin/bash -e {0} 2024-04-03T11:41:25.4847418Z env: 2024-04-03T11:41:25.4848123Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:25.4849155Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:25.4851083Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:25.4853016Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:25.4853968Z AWS_REGION: us-east-1 2024-04-03T11:41:25.4854940Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:25.4856092Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:25.4870865Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:25.4871285Z ##[endgroup] 2024-04-03T11:41:25.4898200Z ##[debug]/usr/bin/bash -e /home/pytorchci/actions-runner/_work/_temp/15f03d8c-c725-4c7b-8af5-5e85f0742c87.sh 2024-04-03T11:41:25.5180709Z ##[debug]Set output tag = ciflow/trunk/123154 2024-04-03T11:41:25.5182962Z ##[debug]Finishing: Parse ref 2024-04-03T11:41:25.5204692Z ##[debug]Evaluating condition for step: 'Get workflow job id' 2024-04-03T11:41:25.5208156Z ##[debug]Evaluating: always() 2024-04-03T11:41:25.5209578Z ##[debug]Evaluating always: 2024-04-03T11:41:25.5210967Z ##[debug]=> true 2024-04-03T11:41:25.5212312Z ##[debug]Result: true 2024-04-03T11:41:25.5214049Z ##[debug]Starting: Get workflow job id 2024-04-03T11:41:25.5278431Z Prepare all required actions 2024-04-03T11:41:25.5320157Z ##[debug]Loading inputs 2024-04-03T11:41:25.5321386Z ##[debug]Evaluating: secrets.GITHUB_TOKEN 2024-04-03T11:41:25.5321942Z ##[debug]Evaluating Index: 2024-04-03T11:41:25.5322410Z ##[debug]..Evaluating secrets: 2024-04-03T11:41:25.5322887Z ##[debug]..=> Object 2024-04-03T11:41:25.5323328Z ##[debug]..Evaluating String: 2024-04-03T11:41:25.5323811Z ##[debug]..=> 'GITHUB_TOKEN' 2024-04-03T11:41:25.5324637Z ##[debug]=> '***' 2024-04-03T11:41:25.5325244Z ##[debug]Result: '***' 2024-04-03T11:41:25.5325767Z ##[debug]Loading env 2024-04-03T11:41:25.5332216Z ##[group]Run ./.github/actions/get-workflow-job-id 2024-04-03T11:41:25.5332772Z with: 2024-04-03T11:41:25.5333288Z github-token: *** 2024-04-03T11:41:25.5333668Z env: 2024-04-03T11:41:25.5334012Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:25.5334527Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:25.5335468Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:25.5336369Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:25.5336851Z AWS_REGION: us-east-1 2024-04-03T11:41:25.5337338Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:25.5337962Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:25.5347856Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:25.5348382Z ##[endgroup] 2024-04-03T11:41:25.5351336Z ##[debug]Evaluating: inputs.github-token 2024-04-03T11:41:25.5351881Z ##[debug]Evaluating Index: 2024-04-03T11:41:25.5352360Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:25.5352824Z ##[debug]..=> Object 2024-04-03T11:41:25.5353252Z ##[debug]..Evaluating String: 2024-04-03T11:41:25.5353721Z ##[debug]..=> 'github-token' 2024-04-03T11:41:25.5354335Z ##[debug]=> '***' 2024-04-03T11:41:25.5354903Z ##[debug]Result: '***' 2024-04-03T11:41:25.5355462Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:25.5356597Z ##[debug]Evaluating: success() 2024-04-03T11:41:25.5357221Z ##[debug]Evaluating success: 2024-04-03T11:41:25.5357872Z ##[debug]=> true 2024-04-03T11:41:25.5358458Z ##[debug]Result: true 2024-04-03T11:41:25.5359129Z ##[debug]Starting: run 2024-04-03T11:41:25.5369502Z ##[debug]Loading inputs 2024-04-03T11:41:25.5370554Z ##[debug]Loading env 2024-04-03T11:41:25.5376849Z ##[group]Run set -eux 2024-04-03T11:41:25.5377263Z set -eux 2024-04-03T11:41:25.5377970Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2024-04-03T11:41:25.5413031Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:25.5413647Z env: 2024-04-03T11:41:25.5413998Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:25.5414514Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:25.5415441Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:25.5416320Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:25.5416792Z AWS_REGION: us-east-1 2024-04-03T11:41:25.5417302Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:25.5417899Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:25.5426972Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:25.5427689Z GITHUB_TOKEN: *** 2024-04-03T11:41:25.5428159Z ##[endgroup] 2024-04-03T11:41:25.5458643Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/3f24a93b-1d23-42a5-a678-3f0559c0c786.sh 2024-04-03T11:41:25.5492065Z + python3 .github/scripts/get_workflow_job_id.py 8536952077 worker-rocm-amd-28 2024-04-03T11:41:27.8734499Z setting job-id=23389745826 2024-04-03T11:41:27.8736702Z setting job-name=linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:27.8898646Z ##[debug]Set output job-id = 23389745826 2024-04-03T11:41:27.8901084Z ##[debug]Set output job-name = linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:27.8904036Z ##[debug]Finished: run 2024-04-03T11:41:27.8912729Z ##[debug]Evaluating: steps.get-job-id.outputs.job-id 2024-04-03T11:41:27.8914502Z ##[debug]Evaluating Index: 2024-04-03T11:41:27.8915747Z ##[debug]..Evaluating Index: 2024-04-03T11:41:27.8916872Z ##[debug]....Evaluating Index: 2024-04-03T11:41:27.8918032Z ##[debug]......Evaluating steps: 2024-04-03T11:41:27.8919222Z ##[debug]......=> Object 2024-04-03T11:41:27.8920328Z ##[debug]......Evaluating String: 2024-04-03T11:41:27.8921520Z ##[debug]......=> 'get-job-id' 2024-04-03T11:41:27.8922655Z ##[debug]....=> Object 2024-04-03T11:41:27.8923709Z ##[debug]....Evaluating String: 2024-04-03T11:41:27.8924896Z ##[debug]....=> 'outputs' 2024-04-03T11:41:27.8925943Z ##[debug]..=> Object 2024-04-03T11:41:27.8926947Z ##[debug]..Evaluating String: 2024-04-03T11:41:27.8928066Z ##[debug]..=> 'job-id' 2024-04-03T11:41:27.8929070Z ##[debug]=> '23389745826' 2024-04-03T11:41:27.8930151Z ##[debug]Result: '23389745826' 2024-04-03T11:41:27.8932183Z ##[debug]Evaluating: steps.get-job-id.outputs.job-name 2024-04-03T11:41:27.8933720Z ##[debug]Evaluating Index: 2024-04-03T11:41:27.8934817Z ##[debug]..Evaluating Index: 2024-04-03T11:41:27.8935947Z ##[debug]....Evaluating Index: 2024-04-03T11:41:27.8937100Z ##[debug]......Evaluating steps: 2024-04-03T11:41:27.8938262Z ##[debug]......=> Object 2024-04-03T11:41:27.8939348Z ##[debug]......Evaluating String: 2024-04-03T11:41:27.8940660Z ##[debug]......=> 'get-job-id' 2024-04-03T11:41:27.8941776Z ##[debug]....=> Object 2024-04-03T11:41:27.8942917Z ##[debug]....Evaluating String: 2024-04-03T11:41:27.8944277Z ##[debug]....=> 'outputs' 2024-04-03T11:41:27.8945516Z ##[debug]..=> Object 2024-04-03T11:41:27.8946731Z ##[debug]..Evaluating String: 2024-04-03T11:41:27.8948069Z ##[debug]..=> 'job-name' 2024-04-03T11:41:27.8950029Z ##[debug]=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:27.8953045Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:27.8960432Z ##[debug]Finishing: Get workflow job id 2024-04-03T11:41:27.8985868Z ##[debug]Evaluating condition for step: 'Check for keep-going label and re-enabled test issues' 2024-04-03T11:41:27.8990138Z ##[debug]Evaluating: success() 2024-04-03T11:41:27.8991937Z ##[debug]Evaluating success: 2024-04-03T11:41:27.8993698Z ##[debug]=> true 2024-04-03T11:41:27.8995350Z ##[debug]Result: true 2024-04-03T11:41:27.8996772Z ##[debug]Starting: Check for keep-going label and re-enabled test issues 2024-04-03T11:41:27.9153673Z Prepare all required actions 2024-04-03T11:41:27.9154314Z Getting action download info 2024-04-03T11:41:28.0761481Z Download action repository 'nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482' (SHA:3e91a01664abd3c5cd539100d10d33b9c5b68482) 2024-04-03T11:41:29.1297981Z ##[debug]Download 'https://api.github.com/repos/nick-fields/retry/tarball/3e91a01664abd3c5cd539100d10d33b9c5b68482' to '/home/pytorchci/actions-runner/_work/_actions/_temp_1f60f44a-1572-42af-b615-5e33b73b7997/0af027a0-768a-4d1f-af14-bd49fd27cdb6.tar.gz' 2024-04-03T11:41:29.1549198Z ##[debug]Unwrap 'nick-fields-retry-3e91a01' to '/home/pytorchci/actions-runner/_work/_actions/nick-fields/retry/3e91a01664abd3c5cd539100d10d33b9c5b68482' 2024-04-03T11:41:29.1637126Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_1f60f44a-1572-42af-b615-5e33b73b7997/0af027a0-768a-4d1f-af14-bd49fd27cdb6.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/nick-fields/retry/3e91a01664abd3c5cd539100d10d33b9c5b68482'. 2024-04-03T11:41:29.1684004Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/nick-fields/retry/3e91a01664abd3c5cd539100d10d33b9c5b68482/action.yml'. 2024-04-03T11:41:29.1955264Z ##[debug]Loading inputs 2024-04-03T11:41:29.1956576Z ##[debug]Evaluating: secrets.GITHUB_TOKEN 2024-04-03T11:41:29.1957135Z ##[debug]Evaluating Index: 2024-04-03T11:41:29.1957608Z ##[debug]..Evaluating secrets: 2024-04-03T11:41:29.1958089Z ##[debug]..=> Object 2024-04-03T11:41:29.1958532Z ##[debug]..Evaluating String: 2024-04-03T11:41:29.1959028Z ##[debug]..=> 'GITHUB_TOKEN' 2024-04-03T11:41:29.1959704Z ##[debug]=> '***' 2024-04-03T11:41:29.1960307Z ##[debug]Result: '***' 2024-04-03T11:41:29.1960913Z ##[debug]Evaluating: inputs.test-matrix 2024-04-03T11:41:29.1961460Z ##[debug]Evaluating Index: 2024-04-03T11:41:29.1961928Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:29.1962405Z ##[debug]..=> Object 2024-04-03T11:41:29.1962838Z ##[debug]..Evaluating String: 2024-04-03T11:41:29.1963317Z ##[debug]..=> 'test-matrix' 2024-04-03T11:41:29.1964111Z ##[debug]=> '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:29.1965338Z ##[debug]Result: '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:29.1966458Z ##[debug]Evaluating: steps.get-job-id.outputs.job-name 2024-04-03T11:41:29.1967086Z ##[debug]Evaluating Index: 2024-04-03T11:41:29.1967562Z ##[debug]..Evaluating Index: 2024-04-03T11:41:29.1968041Z ##[debug]....Evaluating Index: 2024-04-03T11:41:29.1968534Z ##[debug]......Evaluating steps: 2024-04-03T11:41:29.1969020Z ##[debug]......=> Object 2024-04-03T11:41:29.1969475Z ##[debug]......Evaluating String: 2024-04-03T11:41:29.1969976Z ##[debug]......=> 'get-job-id' 2024-04-03T11:41:29.1970456Z ##[debug]....=> Object 2024-04-03T11:41:29.1970888Z ##[debug]....Evaluating String: 2024-04-03T11:41:29.1971365Z ##[debug]....=> 'outputs' 2024-04-03T11:41:29.1971808Z ##[debug]..=> Object 2024-04-03T11:41:29.1972234Z ##[debug]..Evaluating String: 2024-04-03T11:41:29.1972699Z ##[debug]..=> 'job-name' 2024-04-03T11:41:29.1973397Z ##[debug]=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:29.1974439Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:29.1975288Z ##[debug]Loading env 2024-04-03T11:41:29.1982496Z ##[group]Run ./.github/actions/filter-test-configs 2024-04-03T11:41:29.1983155Z with: 2024-04-03T11:41:29.1983766Z github-token: *** 2024-04-03T11:41:29.1984900Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]} 2024-04-03T11:41:29.1986261Z job-name: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:29.1987151Z env: 2024-04-03T11:41:29.1987564Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:29.1988177Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:29.1989310Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:29.1990394Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:29.1990985Z AWS_REGION: us-east-1 2024-04-03T11:41:29.1991474Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:29.1992086Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:29.2000206Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:29.2000630Z ##[endgroup] 2024-04-03T11:41:29.2005658Z ##[debug]Evaluating: inputs.github-token 2024-04-03T11:41:29.2006208Z ##[debug]Evaluating Index: 2024-04-03T11:41:29.2006674Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:29.2007148Z ##[debug]..=> Object 2024-04-03T11:41:29.2007579Z ##[debug]..Evaluating String: 2024-04-03T11:41:29.2008056Z ##[debug]..=> 'github-token' 2024-04-03T11:41:29.2008773Z ##[debug]=> '***' 2024-04-03T11:41:29.2009349Z ##[debug]Result: '***' 2024-04-03T11:41:29.2009901Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:29.2011059Z ##[debug]Evaluating: success() 2024-04-03T11:41:29.2011696Z ##[debug]Evaluating success: 2024-04-03T11:41:29.2012327Z ##[debug]=> true 2024-04-03T11:41:29.2012926Z ##[debug]Result: true 2024-04-03T11:41:29.2013596Z ##[debug]Starting: run 2024-04-03T11:41:29.2074415Z ##[debug]Loading inputs 2024-04-03T11:41:29.2080664Z ##[debug]Loading env 2024-04-03T11:41:29.2087150Z ##[group]Run nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482 2024-04-03T11:41:29.2087823Z with: 2024-04-03T11:41:29.2088152Z shell: bash 2024-04-03T11:41:29.2088521Z timeout_minutes: 10 2024-04-03T11:41:29.2088922Z max_attempts: 5 2024-04-03T11:41:29.2089319Z retry_wait_seconds: 30 2024-04-03T11:41:29.2090262Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore python3 -m pip install requests==2.26.0 pyyaml==6.0.1 2024-04-03T11:41:29.2091293Z polling_interval_seconds: 1 2024-04-03T11:41:29.2091752Z warning_on_retry: true 2024-04-03T11:41:29.2092183Z continue_on_error: false 2024-04-03T11:41:29.2092613Z env: 2024-04-03T11:41:29.2092952Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:29.2093455Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:29.2094377Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:29.2095274Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:29.2095746Z AWS_REGION: us-east-1 2024-04-03T11:41:29.2096233Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:29.2096840Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:29.2105764Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:29.2106482Z GITHUB_TOKEN: *** 2024-04-03T11:41:29.2106944Z ##[endgroup] 2024-04-03T11:41:29.2705547Z ##[debug]Running command set -eux 2024-04-03T11:41:29.2707336Z ##[debug]# PyYAML 6.0 doesn't work with MacOS x86 anymore 2024-04-03T11:41:29.2709947Z ##[debug]python3 -m pip install requests==2.26.0 pyyaml==6.0.1 on linux using shell bash 2024-04-03T11:41:29.2781388Z + python3 -m pip install requests==2.26.0 pyyaml==6.0.1 2024-04-03T11:41:29.6258146Z Defaulting to user installation because normal site-packages is not writeable 2024-04-03T11:41:29.7409643Z Requirement already satisfied: requests==2.26.0 in /home/pytorchci/.local/lib/python3.10/site-packages (2.26.0) 2024-04-03T11:41:29.7414373Z Requirement already satisfied: pyyaml==6.0.1 in /home/pytorchci/.local/lib/python3.10/site-packages (6.0.1) 2024-04-03T11:41:29.7557192Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3/dist-packages (from requests==2.26.0) (3.3) 2024-04-03T11:41:29.7573278Z Requirement already satisfied: charset-normalizer~=2.0.0 in /home/pytorchci/.local/lib/python3.10/site-packages (from requests==2.26.0) (2.0.12) 2024-04-03T11:41:29.7578494Z Requirement already satisfied: certifi>=2017.4.17 in /usr/lib/python3/dist-packages (from requests==2.26.0) (2020.6.20) 2024-04-03T11:41:29.7584594Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3/dist-packages (from requests==2.26.0) (1.26.5) 2024-04-03T11:41:30.0164885Z ##[debug]Code: 0 2024-04-03T11:41:30.0166650Z ##[debug]Signal: null 2024-04-03T11:41:30.2779494Z Command completed after 1 attempt(s). 2024-04-03T11:41:30.2844900Z ##[debug]Node Action run completed with exit code 0 2024-04-03T11:41:30.2852067Z ##[debug]Set output total_attempts = 1 2024-04-03T11:41:30.2853532Z ##[debug]Set output exit_code = 0 2024-04-03T11:41:30.2855396Z ##[debug]Finished: run 2024-04-03T11:41:30.2866389Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:30.2869832Z ##[debug]Evaluating: success() 2024-04-03T11:41:30.2871429Z ##[debug]Evaluating success: 2024-04-03T11:41:30.2872657Z ##[debug]=> true 2024-04-03T11:41:30.2873778Z ##[debug]Result: true 2024-04-03T11:41:30.2875046Z ##[debug]Starting: run 2024-04-03T11:41:30.2893723Z ##[debug]Loading inputs 2024-04-03T11:41:30.2895704Z ##[debug]Loading env 2024-04-03T11:41:30.2907601Z ##[group]Run set -x 2024-04-03T11:41:30.2908328Z set -x 2024-04-03T11:41:30.2908994Z  2024-04-03T11:41:30.2910230Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.2911784Z # in runner workspace 2024-04-03T11:41:30.2912974Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2024-04-03T11:41:30.2967650Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:30.2968774Z env: 2024-04-03T11:41:30.2969398Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:30.2970323Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:30.2972033Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:30.2973691Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:30.2974555Z AWS_REGION: us-east-1 2024-04-03T11:41:30.2975476Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:30.2976617Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:30.2991897Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:30.2992417Z ##[endgroup] 2024-04-03T11:41:30.3019419Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/4147f997-e404-4fa6-8055-6413c720d713.sh 2024-04-03T11:41:30.3053946Z + python3 /home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2024-04-03T11:41:30.3334942Z ##[debug]Set output tag = ciflow/trunk/123154 2024-04-03T11:41:30.3337053Z ##[debug]Finished: run 2024-04-03T11:41:30.3345405Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:30.3349803Z ##[debug]Evaluating: (success() && (inputs.job-name == '')) 2024-04-03T11:41:30.3351876Z ##[debug]Evaluating And: 2024-04-03T11:41:30.3353453Z ##[debug]..Evaluating success: 2024-04-03T11:41:30.3355393Z ##[debug]..=> true 2024-04-03T11:41:30.3357003Z ##[debug]..Evaluating Equal: 2024-04-03T11:41:30.3358753Z ##[debug]....Evaluating Index: 2024-04-03T11:41:30.3360393Z ##[debug]......Evaluating inputs: 2024-04-03T11:41:30.3362055Z ##[debug]......=> Object 2024-04-03T11:41:30.3363638Z ##[debug]......Evaluating String: 2024-04-03T11:41:30.3365291Z ##[debug]......=> 'job-name' 2024-04-03T11:41:30.3368148Z ##[debug]....=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:30.3370585Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3372159Z ##[debug]....=> '' 2024-04-03T11:41:30.3373605Z ##[debug]..=> false 2024-04-03T11:41:30.3375041Z ##[debug]=> false 2024-04-03T11:41:30.3378762Z ##[debug]Expanded: (true && ('linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' == '')) 2024-04-03T11:41:30.3382135Z ##[debug]Result: false 2024-04-03T11:41:30.3388989Z ##[debug]Evaluating: inputs.github-token 2024-04-03T11:41:30.3390457Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3392096Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:30.3393045Z ##[debug]..=> Object 2024-04-03T11:41:30.3393871Z ##[debug]..Evaluating String: 2024-04-03T11:41:30.3394774Z ##[debug]..=> 'github-token' 2024-04-03T11:41:30.3396029Z ##[debug]=> '***' 2024-04-03T11:41:30.3397124Z ##[debug]Result: '***' 2024-04-03T11:41:30.3399228Z ##[debug]Evaluating: (((inputs.job-name == '') && steps.get-job-name.outputs.job-name) || inputs.job-name) 2024-04-03T11:41:30.3401050Z ##[debug]Evaluating Or: 2024-04-03T11:41:30.3401897Z ##[debug]..Evaluating And: 2024-04-03T11:41:30.3402782Z ##[debug]....Evaluating Equal: 2024-04-03T11:41:30.3403713Z ##[debug]......Evaluating Index: 2024-04-03T11:41:30.3404694Z ##[debug]........Evaluating inputs: 2024-04-03T11:41:30.3405670Z ##[debug]........=> Object 2024-04-03T11:41:30.3406565Z ##[debug]........Evaluating String: 2024-04-03T11:41:30.3407530Z ##[debug]........=> 'job-name' 2024-04-03T11:41:30.3408933Z ##[debug]......=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:30.3410441Z ##[debug]......Evaluating String: 2024-04-03T11:41:30.3411377Z ##[debug]......=> '' 2024-04-03T11:41:30.3412146Z ##[debug]....=> false 2024-04-03T11:41:30.3412918Z ##[debug]..=> false 2024-04-03T11:41:30.3413705Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3414605Z ##[debug]....Evaluating inputs: 2024-04-03T11:41:30.3415514Z ##[debug]....=> Object 2024-04-03T11:41:30.3416337Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3417252Z ##[debug]....=> 'job-name' 2024-04-03T11:41:30.3418575Z ##[debug]..=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:30.3421644Z ##[debug]=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:30.3425314Z ##[debug]Expanded: ((('linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' == '') && steps['get-job-name']['outputs']['job-name']) || 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)') 2024-04-03T11:41:30.3428705Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:30.3430755Z ##[debug]Evaluating: github.event.pull_request.number 2024-04-03T11:41:30.3431942Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3432817Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3433709Z ##[debug]....Evaluating Index: 2024-04-03T11:41:30.3434650Z ##[debug]......Evaluating github: 2024-04-03T11:41:30.3435605Z ##[debug]......=> Object 2024-04-03T11:41:30.3436469Z ##[debug]......Evaluating String: 2024-04-03T11:41:30.3437391Z ##[debug]......=> 'event' 2024-04-03T11:41:30.3438247Z ##[debug]....=> Object 2024-04-03T11:41:30.3439090Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3440025Z ##[debug]....=> 'pull_request' 2024-04-03T11:41:30.3440915Z ##[debug]..=> null 2024-04-03T11:41:30.3441677Z ##[debug]=> null 2024-04-03T11:41:30.3442456Z ##[debug]Result: null 2024-04-03T11:41:30.3445279Z ##[debug]Evaluating: steps.parse-ref.outputs.tag 2024-04-03T11:41:30.3446422Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3447314Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3448223Z ##[debug]....Evaluating Index: 2024-04-03T11:41:30.3449145Z ##[debug]......Evaluating steps: 2024-04-03T11:41:30.3450077Z ##[debug]......=> Object 2024-04-03T11:41:30.3450952Z ##[debug]......Evaluating String: 2024-04-03T11:41:30.3451912Z ##[debug]......=> 'parse-ref' 2024-04-03T11:41:30.3452809Z ##[debug]....=> Object 2024-04-03T11:41:30.3453640Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3454563Z ##[debug]....=> 'outputs' 2024-04-03T11:41:30.3455408Z ##[debug]..=> Object 2024-04-03T11:41:30.3456216Z ##[debug]..Evaluating String: 2024-04-03T11:41:30.3457107Z ##[debug]..=> 'tag' 2024-04-03T11:41:30.3457955Z ##[debug]=> 'ciflow/trunk/123154' 2024-04-03T11:41:30.3458961Z ##[debug]Result: 'ciflow/trunk/123154' 2024-04-03T11:41:30.3460398Z ##[debug]Evaluating: github.event_name 2024-04-03T11:41:30.3461425Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3462319Z ##[debug]..Evaluating github: 2024-04-03T11:41:30.3463544Z ##[debug]..=> Object 2024-04-03T11:41:30.3464439Z ##[debug]..Evaluating String: 2024-04-03T11:41:30.3465307Z ##[debug]..=> 'event_name' 2024-04-03T11:41:30.3466125Z ##[debug]=> 'push' 2024-04-03T11:41:30.3466865Z ##[debug]Result: 'push' 2024-04-03T11:41:30.3468026Z ##[debug]Evaluating: github.event.schedule 2024-04-03T11:41:30.3469038Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3469877Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3470733Z ##[debug]....Evaluating github: 2024-04-03T11:41:30.3471606Z ##[debug]....=> Object 2024-04-03T11:41:30.3472401Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3473278Z ##[debug]....=> 'event' 2024-04-03T11:41:30.3474050Z ##[debug]..=> Object 2024-04-03T11:41:30.3474838Z ##[debug]..Evaluating String: 2024-04-03T11:41:30.3475685Z ##[debug]..=> 'schedule' 2024-04-03T11:41:30.3476470Z ##[debug]=> null 2024-04-03T11:41:30.3477195Z ##[debug]Result: null 2024-04-03T11:41:30.3478515Z ##[debug]Evaluating: github.event.workflow_run.head_branch 2024-04-03T11:41:30.3479700Z ##[debug]Evaluating Index: 2024-04-03T11:41:30.3480539Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3481390Z ##[debug]....Evaluating Index: 2024-04-03T11:41:30.3482263Z ##[debug]......Evaluating github: 2024-04-03T11:41:30.3483165Z ##[debug]......=> Object 2024-04-03T11:41:30.3483995Z ##[debug]......Evaluating String: 2024-04-03T11:41:30.3484884Z ##[debug]......=> 'event' 2024-04-03T11:41:30.3485681Z ##[debug]....=> Object 2024-04-03T11:41:30.3486476Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3487384Z ##[debug]....=> 'workflow_run' 2024-04-03T11:41:30.3488222Z ##[debug]..=> null 2024-04-03T11:41:30.3489187Z ##[debug]=> null 2024-04-03T11:41:30.3489906Z ##[debug]Result: null 2024-04-03T11:41:30.3490962Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:30.3493156Z ##[debug]Evaluating: success() 2024-04-03T11:41:30.3494327Z ##[debug]Evaluating success: 2024-04-03T11:41:30.3495478Z ##[debug]=> true 2024-04-03T11:41:30.3496580Z ##[debug]Result: true 2024-04-03T11:41:30.3497807Z ##[debug]Starting: run 2024-04-03T11:41:30.3515856Z ##[debug]Loading inputs 2024-04-03T11:41:30.3517696Z ##[debug]Evaluating: format('echo "Workflow: ${{GITHUB_WORKFLOW}}" 2024-04-03T11:41:30.3518386Z ##[debug]echo "Job name: ${{JOB_NAME}}" 2024-04-03T11:41:30.3518875Z ##[debug] 2024-04-03T11:41:30.3519535Z ##[debug]# Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.3520344Z ##[debug]# in runner workspace 2024-04-03T11:41:30.3521023Z ##[debug]python3 "${{GITHUB_ACTION_PATH}}/../../scripts/filter_test_configs.py" \ 2024-04-03T11:41:30.3521801Z ##[debug] --workflow "${{GITHUB_WORKFLOW}}" \ 2024-04-03T11:41:30.3522379Z ##[debug] --job-name "${{JOB_NAME}}" \ 2024-04-03T11:41:30.3522900Z ##[debug] --test-matrix "{0}" \ 2024-04-03T11:41:30.3523402Z ##[debug] --pr-number "${{PR_NUMBER}}" \ 2024-04-03T11:41:30.3523923Z ##[debug] --tag "${{TAG}}" \ 2024-04-03T11:41:30.3524409Z ##[debug] --event-name "${{EVENT_NAME}}" \ 2024-04-03T11:41:30.3524963Z ##[debug] --schedule "${{SCHEDULE}}" \ 2024-04-03T11:41:30.3525490Z ##[debug] --branch "${{HEAD_BRANCH}}" 2024-04-03T11:41:30.3526004Z ##[debug]', inputs.test-matrix) 2024-04-03T11:41:30.3526495Z ##[debug]Evaluating format: 2024-04-03T11:41:30.3526957Z ##[debug]..Evaluating String: 2024-04-03T11:41:30.3527532Z ##[debug]..=> 'echo "Workflow: ${{GITHUB_WORKFLOW}}" 2024-04-03T11:41:30.3528127Z ##[debug]echo "Job name: ${{JOB_NAME}}" 2024-04-03T11:41:30.3528618Z ##[debug] 2024-04-03T11:41:30.3529260Z ##[debug]# Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.3530070Z ##[debug]# in runner workspace 2024-04-03T11:41:30.3530752Z ##[debug]python3 "${{GITHUB_ACTION_PATH}}/../../scripts/filter_test_configs.py" \ 2024-04-03T11:41:30.3531524Z ##[debug] --workflow "${{GITHUB_WORKFLOW}}" \ 2024-04-03T11:41:30.3532085Z ##[debug] --job-name "${{JOB_NAME}}" \ 2024-04-03T11:41:30.3532607Z ##[debug] --test-matrix "{0}" \ 2024-04-03T11:41:30.3533110Z ##[debug] --pr-number "${{PR_NUMBER}}" \ 2024-04-03T11:41:30.3533796Z ##[debug] --tag "${{TAG}}" \ 2024-04-03T11:41:30.3534294Z ##[debug] --event-name "${{EVENT_NAME}}" \ 2024-04-03T11:41:30.3534842Z ##[debug] --schedule "${{SCHEDULE}}" \ 2024-04-03T11:41:30.3535367Z ##[debug] --branch "${{HEAD_BRANCH}}" 2024-04-03T11:41:30.3535859Z ##[debug]' 2024-04-03T11:41:30.3536639Z ##[debug]..Evaluating Index: 2024-04-03T11:41:30.3537111Z ##[debug]....Evaluating inputs: 2024-04-03T11:41:30.3537592Z ##[debug]....=> Object 2024-04-03T11:41:30.3538026Z ##[debug]....Evaluating String: 2024-04-03T11:41:30.3538508Z ##[debug]....=> 'test-matrix' 2024-04-03T11:41:30.3539320Z ##[debug]..=> '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:30.3540458Z ##[debug]=> 'echo "Workflow: ${GITHUB_WORKFLOW}" 2024-04-03T11:41:30.3541143Z ##[debug]echo "Job name: ${JOB_NAME}" 2024-04-03T11:41:30.3541727Z ##[debug] 2024-04-03T11:41:30.3542508Z ##[debug]# Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.3543499Z ##[debug]# in runner workspace 2024-04-03T11:41:30.3544307Z ##[debug]python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2024-04-03T11:41:30.3545231Z ##[debug] --workflow "${GITHUB_WORKFLOW}" \ 2024-04-03T11:41:30.3545904Z ##[debug] --job-name "${JOB_NAME}" \ 2024-04-03T11:41:30.3546991Z ##[debug] --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" \ 2024-04-03T11:41:30.3548128Z ##[debug] --pr-number "${PR_NUMBER}" \ 2024-04-03T11:41:30.3548741Z ##[debug] --tag "${TAG}" \ 2024-04-03T11:41:30.3549474Z ##[debug] --event-name "${EVENT_NAME}" \ 2024-04-03T11:41:30.3550120Z ##[debug] --schedule "${SCHEDULE}" \ 2024-04-03T11:41:30.3550734Z ##[debug] --branch "${HEAD_BRANCH}" 2024-04-03T11:41:30.3551302Z ##[debug]' 2024-04-03T11:41:30.3551894Z ##[debug]Result: 'echo "Workflow: ${GITHUB_WORKFLOW}" 2024-04-03T11:41:30.3552621Z ##[debug]echo "Job name: ${JOB_NAME}" 2024-04-03T11:41:30.3553169Z ##[debug] 2024-04-03T11:41:30.3553812Z ##[debug]# Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.3554626Z ##[debug]# in runner workspace 2024-04-03T11:41:30.3555304Z ##[debug]python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2024-04-03T11:41:30.3556069Z ##[debug] --workflow "${GITHUB_WORKFLOW}" \ 2024-04-03T11:41:30.3556628Z ##[debug] --job-name "${JOB_NAME}" \ 2024-04-03T11:41:30.3557532Z ##[debug] --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" \ 2024-04-03T11:41:30.3558475Z ##[debug] --pr-number "${PR_NUMBER}" \ 2024-04-03T11:41:30.3558984Z ##[debug] --tag "${TAG}" \ 2024-04-03T11:41:30.3559451Z ##[debug] --event-name "${EVENT_NAME}" \ 2024-04-03T11:41:30.3559991Z ##[debug] --schedule "${SCHEDULE}" \ 2024-04-03T11:41:30.3560503Z ##[debug] --branch "${HEAD_BRANCH}" 2024-04-03T11:41:30.3560990Z ##[debug]' 2024-04-03T11:41:30.3561434Z ##[debug]Loading env 2024-04-03T11:41:30.3567840Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2024-04-03T11:41:30.3568473Z echo "Workflow: ${GITHUB_WORKFLOW}" 2024-04-03T11:41:30.3569035Z echo "Job name: ${JOB_NAME}" 2024-04-03T11:41:30.3569519Z  2024-04-03T11:41:30.3570192Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-04-03T11:41:30.3571034Z # in runner workspace 2024-04-03T11:41:30.3571734Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2024-04-03T11:41:30.3572516Z  --workflow "${GITHUB_WORKFLOW}" \ 2024-04-03T11:41:30.3573081Z  --job-name "${JOB_NAME}" \ 2024-04-03T11:41:30.3574005Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" \ 2024-04-03T11:41:30.3574951Z  --pr-number "${PR_NUMBER}" \ 2024-04-03T11:41:30.3575477Z  --tag "${TAG}" \ 2024-04-03T11:41:30.3576069Z  --event-name "${EVENT_NAME}" \ 2024-04-03T11:41:30.3576626Z  --schedule "${SCHEDULE}" \ 2024-04-03T11:41:30.3577153Z  --branch "${HEAD_BRANCH}" 2024-04-03T11:41:30.3612294Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:30.3612911Z env: 2024-04-03T11:41:30.3613258Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:30.3613765Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:30.3614693Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:30.3615574Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:30.3616051Z AWS_REGION: us-east-1 2024-04-03T11:41:30.3616575Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:30.3617189Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:30.3626347Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:30.3627071Z GITHUB_TOKEN: *** 2024-04-03T11:41:30.3627849Z JOB_NAME: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:30.3628747Z PR_NUMBER: 2024-04-03T11:41:30.3629213Z TAG: ciflow/trunk/123154 2024-04-03T11:41:30.3629729Z EVENT_NAME: push 2024-04-03T11:41:30.3630180Z SCHEDULE: 2024-04-03T11:41:30.3630611Z HEAD_BRANCH: 2024-04-03T11:41:30.3631046Z ##[endgroup] 2024-04-03T11:41:30.3658437Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/c76f1e48-004b-4b57-9806-26813ae0fbfd.sh 2024-04-03T11:41:30.3690249Z Workflow: trunk 2024-04-03T11:41:30.3692640Z Job name: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:30.7857237Z INFO:root:Found no test-config label on the PR, so all test configs are included 2024-04-03T11:41:31.5286717Z fatal: unknown commit main 2024-04-03T11:41:31.5306895Z /home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/filter_test_configs.py:458: UserWarning: failed to get commit messages: Command '['git', 'cherry', '-v', 'main']' returned non-zero exit status 128. 2024-04-03T11:41:31.5312185Z warnings.warn(f"failed to get commit messages: {e}") 2024-04-03T11:41:31.5541758Z ##[debug]Set output keep-going = False 2024-04-03T11:41:31.5543363Z ##[debug]Set output ci-verbose-test-logs = False 2024-04-03T11:41:31.5545058Z ##[debug]Set output ci-no-test-timeout = False 2024-04-03T11:41:31.5546633Z ##[debug]Set output ci-no-td = False 2024-04-03T11:41:31.5548230Z ##[debug]Set output is-unstable = False 2024-04-03T11:41:31.5549741Z ##[debug]Set output reenabled-issues = 2024-04-03T11:41:31.5552374Z ##[debug]Set output test-matrix = {"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]} 2024-04-03T11:41:31.5555187Z ##[debug]Set output is-test-matrix-empty = False 2024-04-03T11:41:31.5557007Z ##[debug]Finished: run 2024-04-03T11:41:31.5563170Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T11:41:31.5566020Z ##[debug]Evaluating: success() 2024-04-03T11:41:31.5567424Z ##[debug]Evaluating success: 2024-04-03T11:41:31.5568911Z ##[debug]=> true 2024-04-03T11:41:31.5570262Z ##[debug]Result: true 2024-04-03T11:41:31.5571736Z ##[debug]Starting: run 2024-04-03T11:41:31.5595418Z ##[debug]Loading inputs 2024-04-03T11:41:31.5598775Z ##[debug]Evaluating: format('echo "Filtered matrix:" 2024-04-03T11:41:31.5600055Z ##[debug]echo "{0}" 2024-04-03T11:41:31.5600870Z ##[debug] 2024-04-03T11:41:31.5601587Z ##[debug]echo 2024-04-03T11:41:31.5602527Z ##[debug]echo "Is the current job unstable? {1}" 2024-04-03T11:41:31.5603722Z ##[debug] 2024-04-03T11:41:31.5604443Z ##[debug]echo 2024-04-03T11:41:31.5605350Z ##[debug]echo "Is keep-going label set? {2}" 2024-04-03T11:41:31.5606519Z ##[debug] 2024-04-03T11:41:31.5607238Z ##[debug]echo 2024-04-03T11:41:31.5608088Z ##[debug]echo "Renabled issues? {3}" 2024-04-03T11:41:31.5610864Z ##[debug]', steps.filter.outputs.test-matrix, steps.filter.outputs.is-unstable, steps.filter.outputs.keep-going, steps.filter.outputs.reenabled-issues) 2024-04-03T11:41:31.5613674Z ##[debug]Evaluating format: 2024-04-03T11:41:31.5615045Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5616185Z ##[debug]..=> 'echo "Filtered matrix:" 2024-04-03T11:41:31.5617264Z ##[debug]echo "{0}" 2024-04-03T11:41:31.5618074Z ##[debug] 2024-04-03T11:41:31.5618788Z ##[debug]echo 2024-04-03T11:41:31.5620055Z ##[debug]echo "Is the current job unstable? {1}" 2024-04-03T11:41:31.5621243Z ##[debug] 2024-04-03T11:41:31.5621955Z ##[debug]echo 2024-04-03T11:41:31.5623027Z ##[debug]echo "Is keep-going label set? {2}" 2024-04-03T11:41:31.5624365Z ##[debug] 2024-04-03T11:41:31.5625213Z ##[debug]echo 2024-04-03T11:41:31.5626196Z ##[debug]echo "Renabled issues? {3}" 2024-04-03T11:41:31.5627443Z ##[debug]' 2024-04-03T11:41:31.5628648Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5629860Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5631105Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.5632383Z ##[debug]........Evaluating steps: 2024-04-03T11:41:31.5633676Z ##[debug]........=> Object 2024-04-03T11:41:31.5634897Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.5635773Z ##[debug]........=> 'filter' 2024-04-03T11:41:31.5636313Z ##[debug]......=> Object 2024-04-03T11:41:31.5636842Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5637425Z ##[debug]......=> 'outputs' 2024-04-03T11:41:31.5637955Z ##[debug]....=> Object 2024-04-03T11:41:31.5638470Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5639037Z ##[debug]....=> 'test-matrix' 2024-04-03T11:41:31.5639982Z ##[debug]..=> '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:31.5641040Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5641608Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5642492Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.5643078Z ##[debug]........Evaluating steps: 2024-04-03T11:41:31.5643662Z ##[debug]........=> Object 2024-04-03T11:41:31.5644211Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.5644802Z ##[debug]........=> 'filter' 2024-04-03T11:41:31.5645348Z ##[debug]......=> Object 2024-04-03T11:41:31.5645841Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5646328Z ##[debug]......=> 'outputs' 2024-04-03T11:41:31.5646788Z ##[debug]....=> Object 2024-04-03T11:41:31.5647217Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5647698Z ##[debug]....=> 'is-unstable' 2024-04-03T11:41:31.5648159Z ##[debug]..=> 'False' 2024-04-03T11:41:31.5648598Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5649062Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5649535Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.5650028Z ##[debug]........Evaluating steps: 2024-04-03T11:41:31.5650519Z ##[debug]........=> Object 2024-04-03T11:41:31.5650984Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.5651482Z ##[debug]........=> 'filter' 2024-04-03T11:41:31.5651946Z ##[debug]......=> Object 2024-04-03T11:41:31.5652394Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5652877Z ##[debug]......=> 'outputs' 2024-04-03T11:41:31.5653318Z ##[debug]....=> Object 2024-04-03T11:41:31.5653748Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5654236Z ##[debug]....=> 'keep-going' 2024-04-03T11:41:31.5654692Z ##[debug]..=> 'False' 2024-04-03T11:41:31.5655126Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5655597Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5656077Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.5656566Z ##[debug]........Evaluating steps: 2024-04-03T11:41:31.5657061Z ##[debug]........=> Object 2024-04-03T11:41:31.5657530Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.5658027Z ##[debug]........=> 'filter' 2024-04-03T11:41:31.5658479Z ##[debug]......=> Object 2024-04-03T11:41:31.5658926Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5659419Z ##[debug]......=> 'outputs' 2024-04-03T11:41:31.5659917Z ##[debug]....=> Object 2024-04-03T11:41:31.5660356Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5660850Z ##[debug]....=> 'reenabled-issues' 2024-04-03T11:41:31.5661340Z ##[debug]..=> '' 2024-04-03T11:41:31.5661787Z ##[debug]=> 'echo "Filtered matrix:" 2024-04-03T11:41:31.5662861Z ##[debug]echo "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" 2024-04-03T11:41:31.5663865Z ##[debug] 2024-04-03T11:41:31.5664256Z ##[debug]echo 2024-04-03T11:41:31.5664783Z ##[debug]echo "Is the current job unstable? False" 2024-04-03T11:41:31.5665441Z ##[debug] 2024-04-03T11:41:31.5665846Z ##[debug]echo 2024-04-03T11:41:31.5666367Z ##[debug]echo "Is keep-going label set? False" 2024-04-03T11:41:31.5667004Z ##[debug] 2024-04-03T11:41:31.5667395Z ##[debug]echo 2024-04-03T11:41:31.5667860Z ##[debug]echo "Renabled issues? " 2024-04-03T11:41:31.5668413Z ##[debug]' 2024-04-03T11:41:31.5668937Z ##[debug]Result: 'echo "Filtered matrix:" 2024-04-03T11:41:31.5669973Z ##[debug]echo "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" 2024-04-03T11:41:31.5670966Z ##[debug] 2024-04-03T11:41:31.5671358Z ##[debug]echo 2024-04-03T11:41:31.5671884Z ##[debug]echo "Is the current job unstable? False" 2024-04-03T11:41:31.5672545Z ##[debug] 2024-04-03T11:41:31.5672947Z ##[debug]echo 2024-04-03T11:41:31.5673454Z ##[debug]echo "Is keep-going label set? False" 2024-04-03T11:41:31.5674087Z ##[debug] 2024-04-03T11:41:31.5674483Z ##[debug]echo 2024-04-03T11:41:31.5674942Z ##[debug]echo "Renabled issues? " 2024-04-03T11:41:31.5675488Z ##[debug]' 2024-04-03T11:41:31.5675949Z ##[debug]Loading env 2024-04-03T11:41:31.5682355Z ##[group]Run echo "Filtered matrix:" 2024-04-03T11:41:31.5682892Z echo "Filtered matrix:" 2024-04-03T11:41:31.5683741Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" 2024-04-03T11:41:31.5684677Z  2024-04-03T11:41:31.5685022Z echo 2024-04-03T11:41:31.5685485Z echo "Is the current job unstable? False" 2024-04-03T11:41:31.5686041Z  2024-04-03T11:41:31.5686379Z echo 2024-04-03T11:41:31.5686825Z echo "Is keep-going label set? False" 2024-04-03T11:41:31.5687368Z  2024-04-03T11:41:31.5687715Z echo 2024-04-03T11:41:31.5688107Z echo "Renabled issues? " 2024-04-03T11:41:31.5723096Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:31.5723716Z env: 2024-04-03T11:41:31.5724070Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:31.5724587Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:31.5725515Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:31.5726408Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:31.5726893Z AWS_REGION: us-east-1 2024-04-03T11:41:31.5727438Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:31.5728050Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:31.5736080Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:31.5736501Z ##[endgroup] 2024-04-03T11:41:31.5764240Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/1f96d01e-b12f-4710-9b35-b778020b457e.sh 2024-04-03T11:41:31.5792825Z Filtered matrix: 2024-04-03T11:41:31.5794657Z {include: [{config: default, shard: 1, num_shards: 1, runner: linux.rocm.gpu}]} 2024-04-03T11:41:31.5796375Z 2024-04-03T11:41:31.5796854Z Is the current job unstable? False 2024-04-03T11:41:31.5797774Z 2024-04-03T11:41:31.5798650Z Is keep-going label set? False 2024-04-03T11:41:31.5799485Z 2024-04-03T11:41:31.5799903Z Renabled issues? 2024-04-03T11:41:31.5807971Z ##[debug]Finished: run 2024-04-03T11:41:31.5814179Z ##[debug]Evaluating: steps.filter.outputs.test-matrix 2024-04-03T11:41:31.5815605Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5816629Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5817704Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5818774Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5820849Z ##[debug]......=> Object 2024-04-03T11:41:31.5821874Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5823045Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5824203Z ##[debug]....=> Object 2024-04-03T11:41:31.5825325Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5826927Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5828073Z ##[debug]..=> Object 2024-04-03T11:41:31.5829158Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5830380Z ##[debug]..=> 'test-matrix' 2024-04-03T11:41:31.5832469Z ##[debug]=> '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:31.5835711Z ##[debug]Result: '{"include": [{"config": "default", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}' 2024-04-03T11:41:31.5839341Z ##[debug]Evaluating: steps.filter.outputs.is-test-matrix-empty 2024-04-03T11:41:31.5841232Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5842511Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5843794Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5845130Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5846454Z ##[debug]......=> Object 2024-04-03T11:41:31.5847699Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5849035Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5850257Z ##[debug]....=> Object 2024-04-03T11:41:31.5851436Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5852735Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5853911Z ##[debug]..=> Object 2024-04-03T11:41:31.5855056Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5856412Z ##[debug]..=> 'is-test-matrix-empty' 2024-04-03T11:41:31.5857819Z ##[debug]=> 'False' 2024-04-03T11:41:31.5858958Z ##[debug]Result: 'False' 2024-04-03T11:41:31.5861488Z ##[debug]Evaluating: steps.filter.outputs.keep-going 2024-04-03T11:41:31.5863207Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5864481Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5866217Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5867546Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5868901Z ##[debug]......=> Object 2024-04-03T11:41:31.5870155Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5871504Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5872735Z ##[debug]....=> Object 2024-04-03T11:41:31.5873922Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5875252Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5876187Z ##[debug]..=> Object 2024-04-03T11:41:31.5876682Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5877156Z ##[debug]..=> 'keep-going' 2024-04-03T11:41:31.5877599Z ##[debug]=> 'False' 2024-04-03T11:41:31.5878012Z ##[debug]Result: 'False' 2024-04-03T11:41:31.5878829Z ##[debug]Evaluating: steps.filter.outputs.reenabled-issues 2024-04-03T11:41:31.5879486Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5879943Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5880410Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5880891Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5881391Z ##[debug]......=> Object 2024-04-03T11:41:31.5881846Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5882337Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5882775Z ##[debug]....=> Object 2024-04-03T11:41:31.5883214Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5883696Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5884133Z ##[debug]..=> Object 2024-04-03T11:41:31.5884564Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5885064Z ##[debug]..=> 'reenabled-issues' 2024-04-03T11:41:31.5885548Z ##[debug]=> '' 2024-04-03T11:41:31.5885944Z ##[debug]Result: '' 2024-04-03T11:41:31.5886736Z ##[debug]Evaluating: steps.filter.outputs.ci-verbose-test-logs 2024-04-03T11:41:31.5887420Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5887882Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5888356Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5888838Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5889327Z ##[debug]......=> Object 2024-04-03T11:41:31.5889806Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5890300Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5890754Z ##[debug]....=> Object 2024-04-03T11:41:31.5891188Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5891671Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5892112Z ##[debug]..=> Object 2024-04-03T11:41:31.5892538Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5893155Z ##[debug]..=> 'ci-verbose-test-logs' 2024-04-03T11:41:31.5893681Z ##[debug]=> 'False' 2024-04-03T11:41:31.5894101Z ##[debug]Result: 'False' 2024-04-03T11:41:31.5894940Z ##[debug]Evaluating: steps.filter.outputs.ci-no-test-timeout 2024-04-03T11:41:31.5895598Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5896054Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5896524Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5897001Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5897484Z ##[debug]......=> Object 2024-04-03T11:41:31.5897939Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5898431Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5898878Z ##[debug]....=> Object 2024-04-03T11:41:31.5899320Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5899864Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5900308Z ##[debug]..=> Object 2024-04-03T11:41:31.5900734Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5901228Z ##[debug]..=> 'ci-no-test-timeout' 2024-04-03T11:41:31.5901730Z ##[debug]=> 'False' 2024-04-03T11:41:31.5902151Z ##[debug]Result: 'False' 2024-04-03T11:41:31.5902996Z ##[debug]Evaluating: steps.filter.outputs.ci-no-td 2024-04-03T11:41:31.5903708Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.5904249Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5904822Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.5905405Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.5905988Z ##[debug]......=> Object 2024-04-03T11:41:31.5906531Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5907128Z ##[debug]......=> 'filter' 2024-04-03T11:41:31.5907656Z ##[debug]....=> Object 2024-04-03T11:41:31.5908364Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5908938Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.5909456Z ##[debug]..=> Object 2024-04-03T11:41:31.5909972Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.5910525Z ##[debug]..=> 'ci-no-td' 2024-04-03T11:41:31.5911043Z ##[debug]=> 'False' 2024-04-03T11:41:31.5911551Z ##[debug]Result: 'False' 2024-04-03T11:41:31.5915214Z ##[debug]Finishing: Check for keep-going label and re-enabled test issues 2024-04-03T11:41:31.5939184Z ##[debug]Evaluating: (((matrix.mem_leak_check == 'mem_leak_check') && 600) || inputs.timeout-minutes) 2024-04-03T11:41:31.5940149Z ##[debug]Evaluating Or: 2024-04-03T11:41:31.5940653Z ##[debug]..Evaluating And: 2024-04-03T11:41:31.5941196Z ##[debug]....Evaluating Equal: 2024-04-03T11:41:31.5941764Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.5942346Z ##[debug]........Evaluating matrix: 2024-04-03T11:41:31.5942970Z ##[debug]........=> Object 2024-04-03T11:41:31.5943526Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.5944172Z ##[debug]........=> 'mem_leak_check' 2024-04-03T11:41:31.5944799Z ##[debug]......=> null 2024-04-03T11:41:31.5945319Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.5945909Z ##[debug]......=> 'mem_leak_check' 2024-04-03T11:41:31.5947887Z ##[debug]....=> false 2024-04-03T11:41:31.5948409Z ##[debug]..=> false 2024-04-03T11:41:31.5948904Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.5949463Z ##[debug]....Evaluating inputs: 2024-04-03T11:41:31.5950114Z ##[debug]....=> Object 2024-04-03T11:41:31.5950645Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.5951231Z ##[debug]....=> 'timeout-minutes' 2024-04-03T11:41:31.5952659Z ##[debug]..=> 300 2024-04-03T11:41:31.5953194Z ##[debug]=> 300 2024-04-03T11:41:31.5953908Z ##[debug]Expanded: (((null == 'mem_leak_check') && 600) || 300) 2024-04-03T11:41:31.5954668Z ##[debug]Result: 300 2024-04-03T11:41:31.5955532Z ##[debug]Evaluating condition for step: 'Set Test step time' 2024-04-03T11:41:31.5957016Z ##[debug]Evaluating: success() 2024-04-03T11:41:31.5957649Z ##[debug]Evaluating success: 2024-04-03T11:41:31.5958297Z ##[debug]=> true 2024-04-03T11:41:31.5958883Z ##[debug]Result: true 2024-04-03T11:41:31.5959620Z ##[debug]Starting: Set Test step time 2024-04-03T11:41:31.5970237Z ##[debug]Loading inputs 2024-04-03T11:41:31.5971299Z ##[debug]Loading env 2024-04-03T11:41:31.5977774Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-04-03T11:41:31.5978622Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-04-03T11:41:31.6013683Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T11:41:31.6014290Z env: 2024-04-03T11:41:31.6014639Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:31.6015252Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:31.6016417Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:31.6017385Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:31.6017906Z AWS_REGION: us-east-1 2024-04-03T11:41:31.6018482Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:31.6019146Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:31.6029709Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:31.6030232Z JOB_TIMEOUT: 300 2024-04-03T11:41:31.6030686Z ##[endgroup] 2024-04-03T11:41:31.6058824Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/2390dd44-f5cd-4dac-bcba-8db21a37f670.sh 2024-04-03T11:41:31.6103245Z ##[debug]Set output timeout = 270 2024-04-03T11:41:31.6105265Z ##[debug]Finishing: Set Test step time 2024-04-03T11:41:31.6129525Z ##[debug]Evaluating: inputs.build-environment 2024-04-03T11:41:31.6131104Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6132306Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:31.6133536Z ##[debug]..=> Object 2024-04-03T11:41:31.6134645Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6135923Z ##[debug]..=> 'build-environment' 2024-04-03T11:41:31.6137334Z ##[debug]=> 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:41:31.6138891Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8' 2024-04-03T11:41:31.6141751Z ##[debug]Evaluating: github.event.pull_request.number 2024-04-03T11:41:31.6143359Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6144545Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6145755Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6147000Z ##[debug]......Evaluating github: 2024-04-03T11:41:31.6148269Z ##[debug]......=> Object 2024-04-03T11:41:31.6149430Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6150684Z ##[debug]......=> 'event' 2024-04-03T11:41:31.6151908Z ##[debug]....=> Object 2024-04-03T11:41:31.6153019Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6154277Z ##[debug]....=> 'pull_request' 2024-04-03T11:41:31.6155464Z ##[debug]..=> null 2024-04-03T11:41:31.6156465Z ##[debug]=> null 2024-04-03T11:41:31.6157474Z ##[debug]Result: null 2024-04-03T11:41:31.6158982Z ##[debug]Evaluating: github.repository 2024-04-03T11:41:31.6160342Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6161518Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6162729Z ##[debug]..=> Object 2024-04-03T11:41:31.6163817Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6165027Z ##[debug]..=> 'repository' 2024-04-03T11:41:31.6166226Z ##[debug]=> 'pytorch/pytorch' 2024-04-03T11:41:31.6167528Z ##[debug]Result: 'pytorch/pytorch' 2024-04-03T11:41:31.6169168Z ##[debug]Evaluating: github.workflow 2024-04-03T11:41:31.6170501Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6171680Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6172885Z ##[debug]..=> Object 2024-04-03T11:41:31.6173989Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6175183Z ##[debug]..=> 'workflow' 2024-04-03T11:41:31.6176298Z ##[debug]=> 'trunk' 2024-04-03T11:41:31.6177344Z ##[debug]Result: 'trunk' 2024-04-03T11:41:31.6178764Z ##[debug]Evaluating: github.job 2024-04-03T11:41:31.6180383Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6181553Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6182741Z ##[debug]..=> Object 2024-04-03T11:41:31.6183823Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6185017Z ##[debug]..=> 'job' 2024-04-03T11:41:31.6185975Z ##[debug]=> 'test' 2024-04-03T11:41:31.6186455Z ##[debug]Result: 'test' 2024-04-03T11:41:31.6187048Z ##[debug]Evaluating: github.run_id 2024-04-03T11:41:31.6187536Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6187978Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6188421Z ##[debug]..=> Object 2024-04-03T11:41:31.6188829Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6189270Z ##[debug]..=> 'run_id' 2024-04-03T11:41:31.6189682Z ##[debug]=> '8536952077' 2024-04-03T11:41:31.6190135Z ##[debug]Result: '8536952077' 2024-04-03T11:41:31.6190703Z ##[debug]Evaluating: github.run_number 2024-04-03T11:41:31.6191206Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6191645Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6192094Z ##[debug]..=> Object 2024-04-03T11:41:31.6192510Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6192957Z ##[debug]..=> 'run_number' 2024-04-03T11:41:31.6193381Z ##[debug]=> '71645' 2024-04-03T11:41:31.6193776Z ##[debug]Result: '71645' 2024-04-03T11:41:31.6194330Z ##[debug]Evaluating: github.run_attempt 2024-04-03T11:41:31.6194843Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6195288Z ##[debug]..Evaluating github: 2024-04-03T11:41:31.6195731Z ##[debug]..=> Object 2024-04-03T11:41:31.6196137Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6196590Z ##[debug]..=> 'run_attempt' 2024-04-03T11:41:31.6197022Z ##[debug]=> '1' 2024-04-03T11:41:31.6197394Z ##[debug]Result: '1' 2024-04-03T11:41:31.6198183Z ##[debug]Evaluating: steps.get-job-id.outputs.job-id 2024-04-03T11:41:31.6198801Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6199240Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6199689Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6200148Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6200625Z ##[debug]......=> Object 2024-04-03T11:41:31.6201058Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6201535Z ##[debug]......=> 'get-job-id' 2024-04-03T11:41:31.6201980Z ##[debug]....=> Object 2024-04-03T11:41:31.6202509Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6202972Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6203387Z ##[debug]..=> Object 2024-04-03T11:41:31.6203796Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6204240Z ##[debug]..=> 'job-id' 2024-04-03T11:41:31.6204648Z ##[debug]=> '23389745826' 2024-04-03T11:41:31.6205081Z ##[debug]Result: '23389745826' 2024-04-03T11:41:31.6205786Z ##[debug]Evaluating: steps.get-job-id.outputs.job-name 2024-04-03T11:41:31.6206401Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6206856Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6207315Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6207783Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6208260Z ##[debug]......=> Object 2024-04-03T11:41:31.6208698Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6209184Z ##[debug]......=> 'get-job-id' 2024-04-03T11:41:31.6209632Z ##[debug]....=> Object 2024-04-03T11:41:31.6210056Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6210525Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6210942Z ##[debug]..=> Object 2024-04-03T11:41:31.6211363Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6211810Z ##[debug]..=> 'job-name' 2024-04-03T11:41:31.6212478Z ##[debug]=> 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:31.6213491Z ##[debug]Result: 'linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)' 2024-04-03T11:41:31.6214491Z ##[debug]Evaluating: steps.parse-ref.outputs.branch 2024-04-03T11:41:31.6215077Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6215527Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6215982Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6216443Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6216910Z ##[debug]......=> Object 2024-04-03T11:41:31.6217345Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6217853Z ##[debug]......=> 'parse-ref' 2024-04-03T11:41:31.6218301Z ##[debug]....=> Object 2024-04-03T11:41:31.6218721Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6219196Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6219696Z ##[debug]..=> Object 2024-04-03T11:41:31.6220109Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6220550Z ##[debug]..=> 'branch' 2024-04-03T11:41:31.6220945Z ##[debug]=> null 2024-04-03T11:41:31.6221327Z ##[debug]Result: null 2024-04-03T11:41:31.6222177Z ##[debug]Evaluating: (github.event.pull_request.head.sha || github.sha) 2024-04-03T11:41:31.6222882Z ##[debug]Evaluating Or: 2024-04-03T11:41:31.6223312Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6223767Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6224232Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.6224712Z ##[debug]........Evaluating Index: 2024-04-03T11:41:31.6225211Z ##[debug]..........Evaluating github: 2024-04-03T11:41:31.6225730Z ##[debug]..........=> Object 2024-04-03T11:41:31.6226192Z ##[debug]..........Evaluating String: 2024-04-03T11:41:31.6226684Z ##[debug]..........=> 'event' 2024-04-03T11:41:31.6227125Z ##[debug]........=> Object 2024-04-03T11:41:31.6227580Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.6228085Z ##[debug]........=> 'pull_request' 2024-04-03T11:41:31.6228560Z ##[debug]......=> null 2024-04-03T11:41:31.6228956Z ##[debug]....=> null 2024-04-03T11:41:31.6229342Z ##[debug]..=> null 2024-04-03T11:41:31.6229735Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6230183Z ##[debug]....Evaluating github: 2024-04-03T11:41:31.6230638Z ##[debug]....=> Object 2024-04-03T11:41:31.6231048Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6231505Z ##[debug]....=> 'sha' 2024-04-03T11:41:31.6232199Z ##[debug]..=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:41:31.6232917Z ##[debug]=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:41:31.6233714Z ##[debug]Expanded: (null || 'd253bd7befc64137de913dbb8727b511f1052411') 2024-04-03T11:41:31.6234512Z ##[debug]Result: 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T11:41:31.6235460Z ##[debug]Evaluating: steps.keep-going.outputs.keep-going 2024-04-03T11:41:31.6236199Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6236644Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6237104Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6237564Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6238033Z ##[debug]......=> Object 2024-04-03T11:41:31.6238465Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6238943Z ##[debug]......=> 'keep-going' 2024-04-03T11:41:31.6239404Z ##[debug]....=> Object 2024-04-03T11:41:31.6239821Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6240289Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6240706Z ##[debug]..=> Object 2024-04-03T11:41:31.6241109Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6241562Z ##[debug]..=> 'keep-going' 2024-04-03T11:41:31.6241995Z ##[debug]=> 'False' 2024-04-03T11:41:31.6242395Z ##[debug]Result: 'False' 2024-04-03T11:41:31.6243152Z ##[debug]Evaluating: steps.keep-going.outputs.ci-verbose-test-logs 2024-04-03T11:41:31.6243841Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6244286Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6244745Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6245230Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6245704Z ##[debug]......=> Object 2024-04-03T11:41:31.6246148Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6246631Z ##[debug]......=> 'keep-going' 2024-04-03T11:41:31.6247082Z ##[debug]....=> Object 2024-04-03T11:41:31.6247504Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6247969Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6248388Z ##[debug]..=> Object 2024-04-03T11:41:31.6248801Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6249308Z ##[debug]..=> 'ci-verbose-test-logs' 2024-04-03T11:41:31.6249807Z ##[debug]=> 'False' 2024-04-03T11:41:31.6250204Z ##[debug]Result: 'False' 2024-04-03T11:41:31.6250945Z ##[debug]Evaluating: steps.keep-going.outputs.ci-no-test-timeout 2024-04-03T11:41:31.6251648Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6252097Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6252577Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6253063Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6253555Z ##[debug]......=> Object 2024-04-03T11:41:31.6254001Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6254495Z ##[debug]......=> 'keep-going' 2024-04-03T11:41:31.6254954Z ##[debug]....=> Object 2024-04-03T11:41:31.6255391Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6255879Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6256316Z ##[debug]..=> Object 2024-04-03T11:41:31.6256747Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6257254Z ##[debug]..=> 'ci-no-test-timeout' 2024-04-03T11:41:31.6257749Z ##[debug]=> 'False' 2024-04-03T11:41:31.6258158Z ##[debug]Result: 'False' 2024-04-03T11:41:31.6258819Z ##[debug]Evaluating: steps.keep-going.outputs.ci-no-td 2024-04-03T11:41:31.6259430Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6259958Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6260411Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6260879Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6261357Z ##[debug]......=> Object 2024-04-03T11:41:31.6261813Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6262299Z ##[debug]......=> 'keep-going' 2024-04-03T11:41:31.6262750Z ##[debug]....=> Object 2024-04-03T11:41:31.6263177Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6263658Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6264082Z ##[debug]..=> Object 2024-04-03T11:41:31.6264496Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6264948Z ##[debug]..=> 'ci-no-td' 2024-04-03T11:41:31.6265371Z ##[debug]=> 'False' 2024-04-03T11:41:31.6265903Z ##[debug]Result: 'False' 2024-04-03T11:41:31.6266502Z ##[debug]Evaluating: matrix.config 2024-04-03T11:41:31.6267002Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6267446Z ##[debug]..Evaluating matrix: 2024-04-03T11:41:31.6267896Z ##[debug]..=> Object 2024-04-03T11:41:31.6268309Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6268759Z ##[debug]..=> 'config' 2024-04-03T11:41:31.6269184Z ##[debug]=> 'default' 2024-04-03T11:41:31.6269597Z ##[debug]Result: 'default' 2024-04-03T11:41:31.6270292Z ##[debug]Evaluating: matrix.shard 2024-04-03T11:41:31.6270786Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6271231Z ##[debug]..Evaluating matrix: 2024-04-03T11:41:31.6271682Z ##[debug]..=> Object 2024-04-03T11:41:31.6272103Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6272548Z ##[debug]..=> 'shard' 2024-04-03T11:41:31.6273233Z ##[debug]=> 1 2024-04-03T11:41:31.6273617Z ##[debug]Result: 1 2024-04-03T11:41:31.6274180Z ##[debug]Evaluating: matrix.num_shards 2024-04-03T11:41:31.6274743Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6275217Z ##[debug]..Evaluating matrix: 2024-04-03T11:41:31.6275690Z ##[debug]..=> Object 2024-04-03T11:41:31.6276119Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6276583Z ##[debug]..=> 'num_shards' 2024-04-03T11:41:31.6277048Z ##[debug]=> 1 2024-04-03T11:41:31.6277427Z ##[debug]Result: 1 2024-04-03T11:41:31.6278133Z ##[debug]Evaluating: steps.keep-going.outputs.reenabled-issues 2024-04-03T11:41:31.6278813Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6279270Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6279728Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6280195Z ##[debug]......Evaluating steps: 2024-04-03T11:41:31.6280696Z ##[debug]......=> Object 2024-04-03T11:41:31.6281137Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6281617Z ##[debug]......=> 'keep-going' 2024-04-03T11:41:31.6282072Z ##[debug]....=> Object 2024-04-03T11:41:31.6282490Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6282958Z ##[debug]....=> 'outputs' 2024-04-03T11:41:31.6283385Z ##[debug]..=> Object 2024-04-03T11:41:31.6283795Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6284270Z ##[debug]..=> 'reenabled-issues' 2024-04-03T11:41:31.6284737Z ##[debug]=> '' 2024-04-03T11:41:31.6285112Z ##[debug]Result: '' 2024-04-03T11:41:31.6285660Z ##[debug]Evaluating: inputs.docker-image 2024-04-03T11:41:31.6286189Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6286635Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:31.6287096Z ##[debug]..=> Object 2024-04-03T11:41:31.6287514Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6287981Z ##[debug]..=> 'docker-image' 2024-04-03T11:41:31.6289062Z ##[debug]=> '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:41:31.6290853Z ##[debug]Result: '308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49' 2024-04-03T11:41:31.6292325Z ##[debug]Evaluating: ((matrix.mem_leak_check && '1') || '0') 2024-04-03T11:41:31.6292960Z ##[debug]Evaluating Or: 2024-04-03T11:41:31.6293395Z ##[debug]..Evaluating And: 2024-04-03T11:41:31.6293846Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6294318Z ##[debug]......Evaluating matrix: 2024-04-03T11:41:31.6294799Z ##[debug]......=> Object 2024-04-03T11:41:31.6295241Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6295736Z ##[debug]......=> 'mem_leak_check' 2024-04-03T11:41:31.6296214Z ##[debug]....=> null 2024-04-03T11:41:31.6296616Z ##[debug]..=> null 2024-04-03T11:41:31.6297015Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6297463Z ##[debug]..=> '0' 2024-04-03T11:41:31.6297841Z ##[debug]=> '0' 2024-04-03T11:41:31.6298298Z ##[debug]Expanded: ((null && '1') || '0') 2024-04-03T11:41:31.6298810Z ##[debug]Result: '0' 2024-04-03T11:41:31.6299535Z ##[debug]Evaluating: ((matrix.rerun_disabled_tests && '1') || '0') 2024-04-03T11:41:31.6300361Z ##[debug]Evaluating Or: 2024-04-03T11:41:31.6300787Z ##[debug]..Evaluating And: 2024-04-03T11:41:31.6301371Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6301850Z ##[debug]......Evaluating matrix: 2024-04-03T11:41:31.6302332Z ##[debug]......=> Object 2024-04-03T11:41:31.6302771Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6303276Z ##[debug]......=> 'rerun_disabled_tests' 2024-04-03T11:41:31.6303786Z ##[debug]....=> null 2024-04-03T11:41:31.6304175Z ##[debug]..=> null 2024-04-03T11:41:31.6304579Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6305019Z ##[debug]..=> '0' 2024-04-03T11:41:31.6305494Z ##[debug]=> '0' 2024-04-03T11:41:31.6305935Z ##[debug]Expanded: ((null && '1') || '0') 2024-04-03T11:41:31.6306440Z ##[debug]Result: '0' 2024-04-03T11:41:31.6307037Z ##[debug]Evaluating: inputs.tests-to-include 2024-04-03T11:41:31.6307588Z ##[debug]Evaluating Index: 2024-04-03T11:41:31.6308039Z ##[debug]..Evaluating inputs: 2024-04-03T11:41:31.6308491Z ##[debug]..=> Object 2024-04-03T11:41:31.6308905Z ##[debug]..Evaluating String: 2024-04-03T11:41:31.6309374Z ##[debug]..=> 'tests-to-include' 2024-04-03T11:41:31.6310373Z ##[debug]=> 'test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor' 2024-04-03T11:41:31.6311934Z ##[debug]Result: 'test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor' 2024-04-03T11:41:31.6313222Z ##[debug]Evaluating condition for step: 'Test' 2024-04-03T11:41:31.6314485Z ##[debug]Evaluating: success() 2024-04-03T11:41:31.6315097Z ##[debug]Evaluating success: 2024-04-03T11:41:31.6315727Z ##[debug]=> true 2024-04-03T11:41:31.6316296Z ##[debug]Result: true 2024-04-03T11:41:31.6316965Z ##[debug]Starting: Test 2024-04-03T11:41:31.6319696Z ##[debug]Evaluating: fromJson(steps.test-timeout.outputs.timeout) 2024-04-03T11:41:31.6320377Z ##[debug]Evaluating fromJson: 2024-04-03T11:41:31.6324063Z ##[debug]..Evaluating Index: 2024-04-03T11:41:31.6324526Z ##[debug]....Evaluating Index: 2024-04-03T11:41:31.6324989Z ##[debug]......Evaluating Index: 2024-04-03T11:41:31.6325475Z ##[debug]........Evaluating steps: 2024-04-03T11:41:31.6325970Z ##[debug]........=> Object 2024-04-03T11:41:31.6326424Z ##[debug]........Evaluating String: 2024-04-03T11:41:31.6326922Z ##[debug]........=> 'test-timeout' 2024-04-03T11:41:31.6327415Z ##[debug]......=> Object 2024-04-03T11:41:31.6327848Z ##[debug]......Evaluating String: 2024-04-03T11:41:31.6328323Z ##[debug]......=> 'outputs' 2024-04-03T11:41:31.6328763Z ##[debug]....=> Object 2024-04-03T11:41:31.6329179Z ##[debug]....Evaluating String: 2024-04-03T11:41:31.6329647Z ##[debug]....=> 'timeout' 2024-04-03T11:41:31.6330067Z ##[debug]..=> '270' 2024-04-03T11:41:31.6347846Z ##[debug]=> 270 2024-04-03T11:41:31.6348259Z ##[debug]Result: 270 2024-04-03T11:41:31.6363557Z ##[debug]Loading inputs 2024-04-03T11:41:31.6364627Z ##[debug]Loading env 2024-04-03T11:41:31.6370827Z ##[group]Run set -x 2024-04-03T11:41:31.6371283Z set -x 2024-04-03T11:41:31.6371637Z  2024-04-03T11:41:31.6372067Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2024-04-03T11:41:31.6372737Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2024-04-03T11:41:31.6373425Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2024-04-03T11:41:31.6374061Z  TEST_COMMAND=.ci/caffe2/test.sh 2024-04-03T11:41:31.6374585Z else 2024-04-03T11:41:31.6375003Z  TEST_COMMAND=.ci/pytorch/test.sh 2024-04-03T11:41:31.6375517Z fi 2024-04-03T11:41:31.6375855Z  2024-04-03T11:41:31.6376420Z # detached container should get cleaned up by teardown_ec2_linux 2024-04-03T11:41:31.6377351Z # TODO: Stop building test binaries as part of the build phase 2024-04-03T11:41:31.6378175Z # Used for GPU_FLAG since that doesn't play nice 2024-04-03T11:41:31.6378837Z # shellcheck disable=SC2086,SC2090 2024-04-03T11:41:31.6379397Z container_name=$(docker run \ 2024-04-03T11:41:31.6380068Z  ${GPU_FLAG:-} \ 2024-04-03T11:41:31.6380521Z  -e BUILD_ENVIRONMENT \ 2024-04-03T11:41:31.6381135Z  -e PR_NUMBER \ 2024-04-03T11:41:31.6381582Z  -e GITHUB_ACTIONS \ 2024-04-03T11:41:31.6382057Z  -e GITHUB_REPOSITORY \ 2024-04-03T11:41:31.6382551Z  -e GITHUB_WORKFLOW \ 2024-04-03T11:41:31.6383019Z  -e GITHUB_JOB \ 2024-04-03T11:41:31.6383466Z  -e GITHUB_RUN_ID \ 2024-04-03T11:41:31.6383935Z  -e GITHUB_RUN_NUMBER \ 2024-04-03T11:41:31.6384424Z  -e GITHUB_RUN_ATTEMPT \ 2024-04-03T11:41:31.6385030Z  -e JOB_ID \ 2024-04-03T11:41:31.6385439Z  -e JOB_NAME \ 2024-04-03T11:41:31.6385855Z  -e BRANCH \ 2024-04-03T11:41:31.6386254Z  -e SHA1 \ 2024-04-03T11:41:31.6386673Z  -e AWS_DEFAULT_REGION \ 2024-04-03T11:41:31.6387158Z  -e IN_WHEEL_TEST \ 2024-04-03T11:41:31.6387615Z  -e SHARD_NUMBER \ 2024-04-03T11:41:31.6388062Z  -e TEST_CONFIG \ 2024-04-03T11:41:31.6388512Z  -e NUM_TEST_SHARDS \ 2024-04-03T11:41:31.6389001Z  -e REENABLED_ISSUES \ 2024-04-03T11:41:31.6389507Z  -e CONTINUE_THROUGH_ERROR \ 2024-04-03T11:41:31.6390020Z  -e VERBOSE_TEST_LOGS \ 2024-04-03T11:41:31.6390511Z  -e NO_TEST_TIMEOUT \ 2024-04-03T11:41:31.6390964Z  -e NO_TD \ 2024-04-03T11:41:31.6391425Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2024-04-03T11:41:31.6391983Z  -e SCCACHE_BUCKET \ 2024-04-03T11:41:31.6392504Z  -e XLA_CLANG_CACHE_S3_BUCKET_NAME \ 2024-04-03T11:41:31.6393129Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2024-04-03T11:41:31.6393764Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2024-04-03T11:41:31.6394342Z  -e TESTS_TO_INCLUDE \ 2024-04-03T11:41:31.6394935Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2024-04-03T11:41:31.6395580Z  --ulimit stack=10485760:83886080 \ 2024-04-03T11:41:31.6396106Z  --ulimit core=0 \ 2024-04-03T11:41:31.6396636Z  --security-opt seccomp=unconfined \ 2024-04-03T11:41:31.6397205Z  --cap-add=SYS_PTRACE \ 2024-04-03T11:41:31.6397687Z  --shm-size="8g" \ 2024-04-03T11:41:31.6398114Z  --tty \ 2024-04-03T11:41:31.6398495Z  --detach \ 2024-04-03T11:41:31.6398929Z  --name="${container_name}" \ 2024-04-03T11:41:31.6399438Z  --user jenkins \ 2024-04-03T11:41:31.6400030Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2024-04-03T11:41:31.6400721Z  -w /var/lib/jenkins/workspace \ 2024-04-03T11:41:31.6401252Z  "${DOCKER_IMAGE}" 2024-04-03T11:41:31.6401675Z ) 2024-04-03T11:41:31.6402087Z # save container name for later step 2024-04-03T11:41:31.6402791Z echo "CONTAINER_NAME=${container_name}" >> "$GITHUB_ENV" 2024-04-03T11:41:31.6404031Z # jenkins user does not have write permission to mounted workspace; work-around by copying within container to jenkins home 2024-04-03T11:41:31.6405679Z docker exec -t "${container_name}" sh -c "cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && ${TEST_COMMAND}" 2024-04-03T11:41:31.6439569Z shell: /usr/bin/bash -e {0} 2024-04-03T11:41:31.6439992Z env: 2024-04-03T11:41:31.6440327Z GIT_DEFAULT_BRANCH: main 2024-04-03T11:41:31.6440823Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T11:41:31.6441739Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T11:41:31.6442622Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T11:41:31.6443085Z AWS_REGION: us-east-1 2024-04-03T11:41:31.6443598Z AWS_ACCESS_KEY_ID: *** 2024-04-03T11:41:31.6444197Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T11:41:31.6452129Z AWS_SESSION_TOKEN: *** 2024-04-03T11:41:31.6452619Z BUILD_ENVIRONMENT: linux-focal-rocm6.0-py3.8 2024-04-03T11:41:31.6453158Z PR_NUMBER: 2024-04-03T11:41:31.6453554Z GITHUB_REPOSITORY: pytorch/pytorch 2024-04-03T11:41:31.6454057Z GITHUB_WORKFLOW: trunk 2024-04-03T11:41:31.6454618Z GITHUB_JOB: test 2024-04-03T11:41:31.6455007Z GITHUB_RUN_ID: 8536952077 2024-04-03T11:41:31.6455443Z GITHUB_RUN_NUMBER: 71645 2024-04-03T11:41:31.6455869Z GITHUB_RUN_ATTEMPT: 1 2024-04-03T11:41:31.6456272Z JOB_ID: 23389745826 2024-04-03T11:41:31.6456903Z JOB_NAME: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:31.6457618Z BRANCH: 2024-04-03T11:41:31.6458032Z SHA1: d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:31.6458698Z CONTINUE_THROUGH_ERROR: False 2024-04-03T11:41:31.6459173Z VERBOSE_TEST_LOGS: False 2024-04-03T11:41:31.6459601Z NO_TEST_TIMEOUT: False 2024-04-03T11:41:31.6460073Z NO_TD: False 2024-04-03T11:41:31.6460434Z TEST_CONFIG: default 2024-04-03T11:41:31.6460828Z SHARD_NUMBER: 1 2024-04-03T11:41:31.6461201Z NUM_TEST_SHARDS: 1 2024-04-03T11:41:31.6461588Z REENABLED_ISSUES: 2024-04-03T11:41:31.6462073Z SCCACHE_BUCKET: ossci-compiler-cache-circleci-v2 2024-04-03T11:41:31.6463307Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:41:31.6464659Z XLA_CLANG_CACHE_S3_BUCKET_NAME: ossci-compiler-clang-cache-circleci-xla 2024-04-03T11:41:31.6465395Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 0 2024-04-03T11:41:31.6465922Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2024-04-03T11:41:31.6466968Z TESTS_TO_INCLUDE: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor 2024-04-03T11:41:31.6467993Z ##[endgroup] 2024-04-03T11:41:31.6495054Z ##[debug]/usr/bin/bash -e /home/pytorchci/actions-runner/_work/_temp/ff45bd4b-c3e9-45ee-8fae-4c48a022d321.sh 2024-04-03T11:41:31.6527923Z + [[ default == \m\u\l\t\i\g\p\u ]] 2024-04-03T11:41:31.6529971Z + [[ linux-focal-rocm6.0-py3.8 == *onnx* ]] 2024-04-03T11:41:31.6531480Z + TEST_COMMAND=.ci/pytorch/test.sh 2024-04-03T11:41:31.6541668Z +++ nproc --ignore=2 2024-04-03T11:41:31.6573732Z ++ docker run --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e NO_TEST_TIMEOUT -e NO_TD -e MAX_JOBS=62 -e SCCACHE_BUCKET -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e TESTS_TO_INCLUDE --env-file=/tmp/github_env_8536952077 --ulimit stack=10485760:83886080 --ulimit core=0 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --shm-size=8g --tty --detach --name= --user jenkins -v /home/pytorchci/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:16884143090f148aa56809ceba26eeb699b5dc49 2024-04-03T11:41:33.4626830Z + container_name=48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T11:41:33.4629853Z + echo CONTAINER_NAME=48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T11:41:33.4635224Z + docker exec -t 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 sh -c 'cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && .ci/pytorch/test.sh' 2024-04-03T11:41:41.8597014Z Processing ./dist/torch-2.4.0a0+gitd253bd7-cp38-cp38-linux_x86_64.whl 2024-04-03T11:41:42.1707759Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (3.13.1) 2024-04-03T11:41:42.1713156Z Requirement already satisfied: typing-extensions>=4.8.0 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (4.10.0) 2024-04-03T11:41:42.1719023Z Requirement already satisfied: sympy in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (1.12) 2024-04-03T11:41:42.1723962Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (2.8.8) 2024-04-03T11:41:42.1728849Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (3.1.3) 2024-04-03T11:41:42.1733749Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch==2.4.0a0+gitd253bd7) (2024.2.0) 2024-04-03T11:41:42.2267747Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from jinja2->torch==2.4.0a0+gitd253bd7) (2.1.5) 2024-04-03T11:41:42.2457995Z Requirement already satisfied: mpmath>=0.19 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from sympy->torch==2.4.0a0+gitd253bd7) (1.3.0) 2024-04-03T11:41:43.1405632Z Installing collected packages: torch 2024-04-03T11:41:53.9205966Z Successfully installed torch-2.4.0a0+gitd253bd7 2024-04-03T11:41:53.9988989Z + echo 'Environment variables:' 2024-04-03T11:41:53.9993546Z Environment variables: 2024-04-03T11:41:53.9994779Z + env 2024-04-03T11:41:54.0010328Z INSTALLED_DB=yes 2024-04-03T11:41:54.0012530Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:41:54.0014481Z CONTINUE_THROUGH_ERROR=False 2024-04-03T11:41:54.0015992Z BUILD_ENVIRONMENT=linux-focal-rocm6.0-py3.8 2024-04-03T11:41:54.0017459Z HOSTNAME=48ab1b864422 2024-04-03T11:41:54.0022632Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0025439Z GITHUB_ACTION=__self 2024-04-03T11:41:54.0026579Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-04-03T11:41:54.0027869Z GITHUB_RUN_NUMBER=71645 2024-04-03T11:41:54.0028981Z TEST_CONFIG=default 2024-04-03T11:41:54.0030079Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-04-03T11:41:54.0031589Z AWS_DEFAULT_REGION=us-east-1 2024-04-03T11:41:54.0033111Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-04-03T11:41:54.0034501Z GITHUB_REF_TYPE=tag 2024-04-03T11:41:54.0041211Z *** 2024-04-03T11:41:54.0042196Z GITHUB_REPOSITORY_ID=65600975 2024-04-03T11:41:54.0043445Z GITHUB_ACTIONS=true 2024-04-03T11:41:54.0044658Z SHA1=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0046338Z GITHUB_SHA=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0048826Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/123154 2024-04-03T11:41:54.0051135Z VERBOSE_TEST_LOGS=False 2024-04-03T11:41:54.0052512Z GITHUB_REF=refs/tags/ciflow/trunk/123154 2024-04-03T11:41:54.0053926Z SHARD_NUMBER=1 2024-04-03T11:41:54.0054935Z GITHUB_REF_PROTECTED=false 2024-04-03T11:41:54.0056098Z HOME=/var/lib/jenkins 2024-04-03T11:41:54.0057287Z GITHUB_API_URL=https://api.github.com 2024-04-03T11:41:54.0058732Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-04-03T11:41:54.0060363Z LANG=C.UTF-8 2024-04-03T11:41:54.0061368Z NUM_TEST_SHARDS=1 2024-04-03T11:41:54.0064168Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0067766Z JOB_NAME=linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:54.0069735Z MAGMA_HOME=/opt/rocm/magma 2024-04-03T11:41:54.0072492Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0076194Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2024-04-03T11:41:54.0078397Z GITHUB_EVENT_NAME=push 2024-04-03T11:41:54.0079502Z GITHUB_RUN_ID=8536952077 2024-04-03T11:41:54.0082503Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0085503Z GITHUB_ACTOR=pytorch-bot[bot] 2024-04-03T11:41:54.0086737Z PR_NUMBER= 2024-04-03T11:41:54.0087658Z GITHUB_RUN_ATTEMPT=1 2024-04-03T11:41:54.0088756Z ANACONDA_PYTHON_VERSION=3.8 2024-04-03T11:41:54.0090763Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-04-03T11:41:54.0092414Z TERM=xterm 2024-04-03T11:41:54.0093408Z INSTALLED_VISION=yes 2024-04-03T11:41:54.0094447Z BRANCH= 2024-04-03T11:41:54.0096792Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor 2024-04-03T11:41:54.0100952Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2024-04-03T11:41:54.0103746Z GITHUB_SERVER_URL=https://github.com 2024-04-03T11:41:54.0105116Z PYTORCH_ROCM_ARCH=gfx906;gfx90a 2024-04-03T11:41:54.0106352Z REENABLED_ISSUES= 2024-04-03T11:41:54.0107338Z SHLVL=1 2024-04-03T11:41:54.0108222Z MAX_JOBS=62 2024-04-03T11:41:54.0109139Z GITHUB_ACTOR_ID=54816060 2024-04-03T11:41:54.0110607Z GITHUB_WORKFLOW_SHA=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0112350Z GITHUB_REF_NAME=ciflow/trunk/123154 2024-04-03T11:41:54.0113682Z ROCM_PATH=/opt/rocm 2024-04-03T11:41:54.0115514Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-04-03T11:41:54.0117411Z GITHUB_JOB=test 2024-04-03T11:41:54.0118402Z NO_TEST_TIMEOUT=False 2024-04-03T11:41:54.0119535Z GITHUB_REPOSITORY=pytorch/pytorch 2024-04-03T11:41:54.0120878Z LC_ALL=C.UTF-8 2024-04-03T11:41:54.0121869Z GITHUB_RETENTION_DAYS=90 2024-04-03T11:41:54.0123021Z GITHUB_ACTION_REPOSITORY= 2024-04-03T11:41:54.0127298Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.8/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-04-03T11:41:54.0131713Z GITHUB_BASE_REF= 2024-04-03T11:41:54.0132717Z CI=true 2024-04-03T11:41:54.0133676Z GITHUB_REPOSITORY_OWNER=pytorch 2024-04-03T11:41:54.0134934Z JOB_ID=23389745826 2024-04-03T11:41:54.0135968Z INSTALLED_PROTOBUF=yes 2024-04-03T11:41:54.0137084Z GITHUB_HEAD_REF= 2024-04-03T11:41:54.0138117Z GITHUB_ACTION_REF= 2024-04-03T11:41:54.0139526Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-04-03T11:41:54.0141417Z GITHUB_WORKFLOW=trunk 2024-04-03T11:41:54.0142533Z DEBIAN_FRONTEND=noninteractive 2024-04-03T11:41:54.0145537Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0148308Z NO_TD=False 2024-04-03T11:41:54.0149232Z OLDPWD=/var/lib/jenkins 2024-04-03T11:41:54.0150322Z _=/usr/bin/env 2024-04-03T11:41:54.0151859Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2024-04-03T11:41:54.0252291Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch 2024-04-03T11:41:54.0255226Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/bin 2024-04-03T11:41:54.0258045Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib 2024-04-03T11:41:54.0261075Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/test 2024-04-03T11:41:54.0263065Z + BUILD_DIR=build 2024-04-03T11:41:54.0264162Z + BUILD_RENAMED_DIR=build_renamed 2024-04-03T11:41:54.0265461Z + BUILD_BIN_DIR=build/bin 2024-04-03T11:41:54.0266585Z + SHARD_NUMBER=1 2024-04-03T11:41:54.0267582Z + NUM_TEST_SHARDS=1 2024-04-03T11:41:54.0268613Z + export VALGRIND=ON 2024-04-03T11:41:54.0269646Z + VALGRIND=ON 2024-04-03T11:41:54.0270921Z + [[ linux-focal-rocm6.0-py3.8 == *clang9* ]] 2024-04-03T11:41:54.0272330Z + [[ 0 == \1 ]] 2024-04-03T11:41:54.0273300Z + [[ False == \1 ]] 2024-04-03T11:41:54.0274633Z + [[ linux-focal-rocm6.0-py3.8 != *bazel* ]] 2024-04-03T11:41:54.0276176Z ++ realpath build/custom_test_artifacts 2024-04-03T11:41:54.0289843Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/pytorch/build/custom_test_artifacts 2024-04-03T11:41:54.0293788Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor ]] 2024-04-03T11:41:54.0296635Z + echo 'Setting INCLUDE_CLAUSE' 2024-04-03T11:41:54.0297899Z Setting INCLUDE_CLAUSE 2024-04-03T11:41:54.0301650Z + INCLUDE_CLAUSE='--include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor' 2024-04-03T11:41:54.0304800Z ++ dirname .ci/pytorch/test.sh 2024-04-03T11:41:54.0306085Z + source .ci/pytorch/common.sh 2024-04-03T11:41:54.0311902Z +++ dirname .ci/pytorch/common.sh 2024-04-03T11:41:54.0333114Z ++ source .ci/pytorch/common_utils.sh 2024-04-03T11:41:54.0334296Z +++ declare -f -t trap_add 2024-04-03T11:41:54.0358912Z ++ set -ex 2024-04-03T11:41:54.0360186Z ++ [[ linux-focal-rocm6.0-py3.8 == *rocm* ]] 2024-04-03T11:41:54.0361192Z ++ unset HIP_PLATFORM 2024-04-03T11:41:54.0362001Z ++ export PYTORCH_TEST_WITH_ROCM=1 2024-04-03T11:41:54.0362920Z ++ PYTORCH_TEST_WITH_ROCM=1 2024-04-03T11:41:54.0363805Z ++ export HSAKMT_DEBUG_LEVEL=4 2024-04-03T11:41:54.0364652Z ++ HSAKMT_DEBUG_LEVEL=4 2024-04-03T11:41:54.0365475Z ++ export HSA_FORCE_FINE_GRAIN_PCIE=1 2024-04-03T11:41:54.0366432Z ++ HSA_FORCE_FINE_GRAIN_PCIE=1 2024-04-03T11:41:54.0367282Z ++ BUILD_TEST_LIBTORCH=0 2024-04-03T11:41:54.0368204Z + echo 'Environment variables' 2024-04-03T11:41:54.0369062Z Environment variables 2024-04-03T11:41:54.0369778Z + env 2024-04-03T11:41:54.0378879Z INSTALLED_DB=yes 2024-04-03T11:41:54.0380405Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T11:41:54.0381771Z CONTINUE_THROUGH_ERROR=False 2024-04-03T11:41:54.0382817Z BUILD_ENVIRONMENT=linux-focal-rocm6.0-py3.8 2024-04-03T11:41:54.0383854Z HOSTNAME=48ab1b864422 2024-04-03T11:41:54.0385975Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0387855Z GITHUB_ACTION=__self 2024-04-03T11:41:54.0388645Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-04-03T11:41:54.0389541Z GITHUB_RUN_NUMBER=71645 2024-04-03T11:41:54.0390293Z TEST_CONFIG=default 2024-04-03T11:41:54.0391054Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-04-03T11:41:54.0392068Z AWS_DEFAULT_REGION=us-east-1 2024-04-03T11:41:54.0393069Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-04-03T11:41:54.0394032Z GITHUB_REF_TYPE=tag 2024-04-03T11:41:54.0394868Z *** 2024-04-03T11:41:54.0395517Z GITHUB_REPOSITORY_ID=65600975 2024-04-03T11:41:54.0396385Z GITHUB_ACTIONS=true 2024-04-03T11:41:54.0397243Z SHA1=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0398430Z GITHUB_SHA=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0400155Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/123154 2024-04-03T11:41:54.0401744Z VERBOSE_TEST_LOGS=False 2024-04-03T11:41:54.0402591Z GITHUB_REF=refs/tags/ciflow/trunk/123154 2024-04-03T11:41:54.0403563Z SHARD_NUMBER=1 2024-04-03T11:41:54.0404266Z GITHUB_REF_PROTECTED=false 2024-04-03T11:41:54.0405074Z HOME=/var/lib/jenkins 2024-04-03T11:41:54.0405921Z GITHUB_API_URL=https://api.github.com 2024-04-03T11:41:54.0406925Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-04-03T11:41:54.0407885Z LANG=C.UTF-8 2024-04-03T11:41:54.0408591Z PYTORCH_TEST_WITH_ROCM=1 2024-04-03T11:41:54.0409396Z NUM_TEST_SHARDS=1 2024-04-03T11:41:54.0411313Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0413786Z JOB_NAME=linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu) 2024-04-03T11:41:54.0415164Z MAGMA_HOME=/opt/rocm/magma 2024-04-03T11:41:54.0417120Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0418953Z HSAKMT_DEBUG_LEVEL=4 2024-04-03T11:41:54.0420687Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2024-04-03T11:41:54.0422219Z GITHUB_EVENT_NAME=push 2024-04-03T11:41:54.0422991Z GITHUB_RUN_ID=8536952077 2024-04-03T11:41:54.0425080Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0427161Z GITHUB_ACTOR=pytorch-bot[bot] 2024-04-03T11:41:54.0428032Z PR_NUMBER= 2024-04-03T11:41:54.0428976Z GITHUB_RUN_ATTEMPT=1 2024-04-03T11:41:54.0429759Z VALGRIND=ON 2024-04-03T11:41:54.0430442Z ANACONDA_PYTHON_VERSION=3.8 2024-04-03T11:41:54.0431450Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-04-03T11:41:54.0432533Z TERM=xterm 2024-04-03T11:41:54.0433191Z INSTALLED_VISION=yes 2024-04-03T11:41:54.0433934Z BRANCH= 2024-04-03T11:41:54.0435603Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor 2024-04-03T11:41:54.0438667Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2024-04-03T11:41:54.0440388Z GITHUB_SERVER_URL=https://github.com 2024-04-03T11:41:54.0441361Z PYTORCH_ROCM_ARCH=gfx906;gfx90a 2024-04-03T11:41:54.0442245Z REENABLED_ISSUES= 2024-04-03T11:41:54.0442955Z SHLVL=1 2024-04-03T11:41:54.0443578Z MAX_JOBS=62 2024-04-03T11:41:54.0444235Z GITHUB_ACTOR_ID=54816060 2024-04-03T11:41:54.0445280Z GITHUB_WORKFLOW_SHA=d253bd7befc64137de913dbb8727b511f1052411 2024-04-03T11:41:54.0446505Z GITHUB_REF_NAME=ciflow/trunk/123154 2024-04-03T11:41:54.0447427Z ROCM_PATH=/opt/rocm 2024-04-03T11:41:54.0448684Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-04-03T11:41:54.0449990Z GITHUB_JOB=test 2024-04-03T11:41:54.0450684Z NO_TEST_TIMEOUT=False 2024-04-03T11:41:54.0451482Z GITHUB_REPOSITORY=pytorch/pytorch 2024-04-03T11:41:54.0452421Z LC_ALL=C.UTF-8 2024-04-03T11:41:54.0453122Z GITHUB_RETENTION_DAYS=90 2024-04-03T11:41:54.0453941Z GITHUB_ACTION_REPOSITORY= 2024-04-03T11:41:54.0456906Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.8/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-04-03T11:41:54.0460008Z GITHUB_BASE_REF= 2024-04-03T11:41:54.0460713Z CI=true 2024-04-03T11:41:54.0461356Z HSA_FORCE_FINE_GRAIN_PCIE=1 2024-04-03T11:41:54.0462226Z GITHUB_REPOSITORY_OWNER=pytorch 2024-04-03T11:41:54.0463098Z JOB_ID=23389745826 2024-04-03T11:41:54.0463839Z INSTALLED_PROTOBUF=yes 2024-04-03T11:41:54.0464626Z GITHUB_HEAD_REF= 2024-04-03T11:41:54.0465353Z GITHUB_ACTION_REF= 2024-04-03T11:41:54.0466343Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-04-03T11:41:54.0467403Z GITHUB_WORKFLOW=trunk 2024-04-03T11:41:54.0468193Z DEBIAN_FRONTEND=noninteractive 2024-04-03T11:41:54.0470250Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_2e66fa44-0515-4978-966e-1cc88ba7c255 2024-04-03T11:41:54.0472143Z NO_TD=False 2024-04-03T11:41:54.0472802Z OLDPWD=/var/lib/jenkins 2024-04-03T11:41:54.0473585Z _=/usr/bin/env 2024-04-03T11:41:54.0474362Z + echo 'Testing pytorch' 2024-04-03T11:41:54.0475151Z Testing pytorch 2024-04-03T11:41:54.0475887Z + export LANG=C.UTF-8 2024-04-03T11:41:54.0476651Z + LANG=C.UTF-8 2024-04-03T11:41:54.0477318Z + PR_NUMBER= 2024-04-03T11:41:54.0478021Z + [[ default == \d\e\f\a\u\l\t ]] 2024-04-03T11:41:54.0478975Z + export CUDA_VISIBLE_DEVICES=0 2024-04-03T11:41:54.0479850Z + CUDA_VISIBLE_DEVICES=0 2024-04-03T11:41:54.0480677Z + export HIP_VISIBLE_DEVICES=0 2024-04-03T11:41:54.0481568Z + HIP_VISIBLE_DEVICES=0 2024-04-03T11:41:54.0482408Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2024-04-03T11:41:54.0483373Z + [[ default == \s\l\o\w ]] 2024-04-03T11:41:54.0484500Z + [[ linux-focal-rocm6.0-py3.8 == *slow-gradcheck* ]] 2024-04-03T11:41:54.0485783Z + [[ linux-focal-rocm6.0-py3.8 == *cuda* ]] 2024-04-03T11:41:54.0486952Z + [[ linux-focal-rocm6.0-py3.8 == *rocm* ]] 2024-04-03T11:41:54.0488071Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-04-03T11:41:54.0489151Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-04-03T11:41:54.0490116Z + [[ default == *crossref* ]] 2024-04-03T11:41:54.0491140Z + [[ linux-focal-rocm6.0-py3.8 == *rocm* ]] 2024-04-03T11:41:54.0492131Z + export VALGRIND=OFF 2024-04-03T11:41:54.0492870Z + VALGRIND=OFF 2024-04-03T11:41:54.0493537Z + rocminfo 2024-04-03T11:41:54.0549536Z ROCk module is loaded 2024-04-03T11:41:54.1170576Z ===================== 2024-04-03T11:41:54.1171792Z HSA System Attributes 2024-04-03T11:41:54.1172625Z ===================== 2024-04-03T11:41:54.1173439Z Runtime Version: 1.1 2024-04-03T11:41:54.1174350Z System Timestamp Freq.: 1000.000000MHz 2024-04-03T11:41:54.1175884Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2024-04-03T11:41:54.1177579Z Machine Model: LARGE 2024-04-03T11:41:54.1178982Z System Endianness: LITTLE 2024-04-03T11:41:54.1180501Z Mwaitx: DISABLED 2024-04-03T11:41:54.1219988Z DMAbuf Support: YES 2024-04-03T11:41:54.1220615Z 2024-04-03T11:41:54.1220903Z ========== 2024-04-03T11:41:54.1221760Z HSA Agents 2024-04-03T11:41:54.1222611Z ========== 2024-04-03T11:41:54.1223430Z ******* 2024-04-03T11:41:54.1224206Z Agent 1 2024-04-03T11:41:54.1224992Z ******* 2024-04-03T11:41:54.1226279Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1228043Z Uuid: CPU-XX 2024-04-03T11:41:54.1229575Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1231028Z Vendor Name: CPU 2024-04-03T11:41:54.1232373Z Feature: None specified 2024-04-03T11:41:54.1233714Z Profile: FULL_PROFILE 2024-04-03T11:41:54.1235116Z Float Round Mode: NEAR 2024-04-03T11:41:54.1236524Z Max Queue Number: 0(0x0) 2024-04-03T11:41:54.1237938Z Queue Min Size: 0(0x0) 2024-04-03T11:41:54.1239311Z Queue Max Size: 0(0x0) 2024-04-03T11:41:54.1240718Z Queue Type: MULTI 2024-04-03T11:41:54.1242006Z Node: 0 2024-04-03T11:41:54.1243302Z Device Type: CPU 2024-04-03T11:41:54.1244454Z Cache Info: 2024-04-03T11:41:54.1245435Z L1: 32768(0x8000) KB 2024-04-03T11:41:54.1246707Z Chip ID: 0(0x0) 2024-04-03T11:41:54.1248022Z ASIC Revision: 0(0x0) 2024-04-03T11:41:54.1249378Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1250754Z Max Clock Freq. (MHz): 2200 2024-04-03T11:41:54.1252064Z BDFID: 0 2024-04-03T11:41:54.1253369Z Internal Node ID: 0 2024-04-03T11:41:54.1254734Z Compute Unit: 16 2024-04-03T11:41:54.1256067Z SIMDs per CU: 0 2024-04-03T11:41:54.1257445Z Shader Engines: 0 2024-04-03T11:41:54.1258847Z Shader Arrs. per Eng.: 0 2024-04-03T11:41:54.1260575Z WatchPts on Addr. Ranges:1 2024-04-03T11:41:54.1261791Z Features: None 2024-04-03T11:41:54.1262675Z Pool Info: 2024-04-03T11:41:54.1263523Z Pool 1 2024-04-03T11:41:54.1264605Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1265972Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:41:54.1267316Z Allocatable: TRUE 2024-04-03T11:41:54.1268682Z Alloc Granule: 4KB 2024-04-03T11:41:54.1270067Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1271482Z Accessible by all: TRUE 2024-04-03T11:41:54.1273020Z Pool 2 2024-04-03T11:41:54.1274145Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:41:54.1275202Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:41:54.1275910Z Allocatable: TRUE 2024-04-03T11:41:54.1276627Z Alloc Granule: 4KB 2024-04-03T11:41:54.1277353Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1278225Z Accessible by all: TRUE 2024-04-03T11:41:54.1278846Z Pool 3 2024-04-03T11:41:54.1279407Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1280112Z Size: 131917252(0x7dce5c4) KB 2024-04-03T11:41:54.1280820Z Allocatable: TRUE 2024-04-03T11:41:54.1281547Z Alloc Granule: 4KB 2024-04-03T11:41:54.1282271Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1283002Z Accessible by all: TRUE 2024-04-03T11:41:54.1283622Z ISA Info: 2024-04-03T11:41:54.1284053Z ******* 2024-04-03T11:41:54.1284475Z Agent 2 2024-04-03T11:41:54.1284883Z ******* 2024-04-03T11:41:54.1285492Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1286232Z Uuid: CPU-XX 2024-04-03T11:41:54.1287010Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1287746Z Vendor Name: CPU 2024-04-03T11:41:54.1288453Z Feature: None specified 2024-04-03T11:41:54.1289151Z Profile: FULL_PROFILE 2024-04-03T11:41:54.1289879Z Float Round Mode: NEAR 2024-04-03T11:41:54.1290612Z Max Queue Number: 0(0x0) 2024-04-03T11:41:54.1291354Z Queue Min Size: 0(0x0) 2024-04-03T11:41:54.1292076Z Queue Max Size: 0(0x0) 2024-04-03T11:41:54.1292782Z Queue Type: MULTI 2024-04-03T11:41:54.1293459Z Node: 1 2024-04-03T11:41:54.1294134Z Device Type: CPU 2024-04-03T11:41:54.1294735Z Cache Info: 2024-04-03T11:41:54.1295254Z L1: 32768(0x8000) KB 2024-04-03T11:41:54.1295926Z Chip ID: 0(0x0) 2024-04-03T11:41:54.1296623Z ASIC Revision: 0(0x0) 2024-04-03T11:41:54.1297338Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1298062Z Max Clock Freq. (MHz): 2200 2024-04-03T11:41:54.1298750Z BDFID: 0 2024-04-03T11:41:54.1299436Z Internal Node ID: 1 2024-04-03T11:41:54.1300204Z Compute Unit: 16 2024-04-03T11:41:54.1300963Z SIMDs per CU: 0 2024-04-03T11:41:54.1301806Z Shader Engines: 0 2024-04-03T11:41:54.1302685Z Shader Arrs. per Eng.: 0 2024-04-03T11:41:54.1303588Z WatchPts on Addr. Ranges:1 2024-04-03T11:41:54.1304348Z Features: None 2024-04-03T11:41:54.1304895Z Pool Info: 2024-04-03T11:41:54.1305429Z Pool 1 2024-04-03T11:41:54.1306247Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1307106Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1307934Z Allocatable: TRUE 2024-04-03T11:41:54.1308796Z Alloc Granule: 4KB 2024-04-03T11:41:54.1309665Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1310548Z Accessible by all: TRUE 2024-04-03T11:41:54.1311397Z Pool 2 2024-04-03T11:41:54.1312073Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:41:54.1312926Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1313757Z Allocatable: TRUE 2024-04-03T11:41:54.1314576Z Alloc Granule: 4KB 2024-04-03T11:41:54.1315305Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1316044Z Accessible by all: TRUE 2024-04-03T11:41:54.1316660Z Pool 3 2024-04-03T11:41:54.1317224Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1317916Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1318605Z Allocatable: TRUE 2024-04-03T11:41:54.1319326Z Alloc Granule: 4KB 2024-04-03T11:41:54.1320042Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1320770Z Accessible by all: TRUE 2024-04-03T11:41:54.1321384Z ISA Info: 2024-04-03T11:41:54.1321811Z ******* 2024-04-03T11:41:54.1322223Z Agent 3 2024-04-03T11:41:54.1322634Z ******* 2024-04-03T11:41:54.1323235Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1323970Z Uuid: CPU-XX 2024-04-03T11:41:54.1324741Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1325474Z Vendor Name: CPU 2024-04-03T11:41:54.1326168Z Feature: None specified 2024-04-03T11:41:54.1326861Z Profile: FULL_PROFILE 2024-04-03T11:41:54.1327576Z Float Round Mode: NEAR 2024-04-03T11:41:54.1328312Z Max Queue Number: 0(0x0) 2024-04-03T11:41:54.1329024Z Queue Min Size: 0(0x0) 2024-04-03T11:41:54.1329729Z Queue Max Size: 0(0x0) 2024-04-03T11:41:54.1330426Z Queue Type: MULTI 2024-04-03T11:41:54.1331098Z Node: 2 2024-04-03T11:41:54.1331765Z Device Type: CPU 2024-04-03T11:41:54.1332361Z Cache Info: 2024-04-03T11:41:54.1332871Z L1: 32768(0x8000) KB 2024-04-03T11:41:54.1333532Z Chip ID: 0(0x0) 2024-04-03T11:41:54.1334219Z ASIC Revision: 0(0x0) 2024-04-03T11:41:54.1334931Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1335650Z Max Clock Freq. (MHz): 2200 2024-04-03T11:41:54.1336345Z BDFID: 0 2024-04-03T11:41:54.1337040Z Internal Node ID: 2 2024-04-03T11:41:54.1337745Z Compute Unit: 16 2024-04-03T11:41:54.1338440Z SIMDs per CU: 0 2024-04-03T11:41:54.1339263Z Shader Engines: 0 2024-04-03T11:41:54.1340053Z Shader Arrs. per Eng.: 0 2024-04-03T11:41:54.1340836Z WatchPts on Addr. Ranges:1 2024-04-03T11:41:54.1341573Z Features: None 2024-04-03T11:41:54.1342123Z Pool Info: 2024-04-03T11:41:54.1342640Z Pool 1 2024-04-03T11:41:54.1343310Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1344273Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1345102Z Allocatable: TRUE 2024-04-03T11:41:54.1345967Z Alloc Granule: 4KB 2024-04-03T11:41:54.1346840Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1347727Z Accessible by all: TRUE 2024-04-03T11:41:54.1348477Z Pool 2 2024-04-03T11:41:54.1349162Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:41:54.1350010Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1350842Z Allocatable: TRUE 2024-04-03T11:41:54.1351712Z Alloc Granule: 4KB 2024-04-03T11:41:54.1352586Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1353477Z Accessible by all: TRUE 2024-04-03T11:41:54.1354215Z Pool 3 2024-04-03T11:41:54.1354861Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1355559Z Size: 132112152(0x7dfdf18) KB 2024-04-03T11:41:54.1356254Z Allocatable: TRUE 2024-04-03T11:41:54.1356973Z Alloc Granule: 4KB 2024-04-03T11:41:54.1357700Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1358432Z Accessible by all: TRUE 2024-04-03T11:41:54.1359044Z ISA Info: 2024-04-03T11:41:54.1359493Z ******* 2024-04-03T11:41:54.1359905Z Agent 4 2024-04-03T11:41:54.1360313Z ******* 2024-04-03T11:41:54.1360918Z Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1361650Z Uuid: CPU-XX 2024-04-03T11:41:54.1362420Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.1363142Z Vendor Name: CPU 2024-04-03T11:41:54.1363838Z Feature: None specified 2024-04-03T11:41:54.1364530Z Profile: FULL_PROFILE 2024-04-03T11:41:54.1365242Z Float Round Mode: NEAR 2024-04-03T11:41:54.1365967Z Max Queue Number: 0(0x0) 2024-04-03T11:41:54.1366676Z Queue Min Size: 0(0x0) 2024-04-03T11:41:54.1367391Z Queue Max Size: 0(0x0) 2024-04-03T11:41:54.1368097Z Queue Type: MULTI 2024-04-03T11:41:54.1368764Z Node: 3 2024-04-03T11:41:54.1369444Z Device Type: CPU 2024-04-03T11:41:54.1370036Z Cache Info: 2024-04-03T11:41:54.1370548Z L1: 32768(0x8000) KB 2024-04-03T11:41:54.1371216Z Chip ID: 0(0x0) 2024-04-03T11:41:54.1371910Z ASIC Revision: 0(0x0) 2024-04-03T11:41:54.1372730Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1373458Z Max Clock Freq. (MHz): 2200 2024-04-03T11:41:54.1374143Z BDFID: 0 2024-04-03T11:41:54.1374836Z Internal Node ID: 3 2024-04-03T11:41:54.1375539Z Compute Unit: 16 2024-04-03T11:41:54.1376232Z SIMDs per CU: 0 2024-04-03T11:41:54.1377018Z Shader Engines: 0 2024-04-03T11:41:54.1377738Z Shader Arrs. per Eng.: 0 2024-04-03T11:41:54.1378482Z WatchPts on Addr. Ranges:1 2024-04-03T11:41:54.1379102Z Features: None 2024-04-03T11:41:54.1379567Z Pool Info: 2024-04-03T11:41:54.1380150Z Pool 1 2024-04-03T11:41:54.1380740Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1381580Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:41:54.1382407Z Allocatable: TRUE 2024-04-03T11:41:54.1383270Z Alloc Granule: 4KB 2024-04-03T11:41:54.1384142Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1385029Z Accessible by all: TRUE 2024-04-03T11:41:54.1385777Z Pool 2 2024-04-03T11:41:54.1386453Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-04-03T11:41:54.1387305Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:41:54.1388144Z Allocatable: TRUE 2024-04-03T11:41:54.1389003Z Alloc Granule: 4KB 2024-04-03T11:41:54.1389879Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1390775Z Accessible by all: TRUE 2024-04-03T11:41:54.1391515Z Pool 3 2024-04-03T11:41:54.1392191Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1393033Z Size: 132106664(0x7dfc9a8) KB 2024-04-03T11:41:54.1393861Z Allocatable: TRUE 2024-04-03T11:41:54.1394706Z Alloc Granule: 4KB 2024-04-03T11:41:54.1395430Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1396161Z Accessible by all: TRUE 2024-04-03T11:41:54.1396770Z ISA Info: 2024-04-03T11:41:54.1397199Z ******* 2024-04-03T11:41:54.1397610Z Agent 5 2024-04-03T11:41:54.1398020Z ******* 2024-04-03T11:41:54.1398513Z Name: gfx906 2024-04-03T11:41:54.1399245Z Uuid: GPU-c406418172e62692 2024-04-03T11:41:54.1399949Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.1400664Z Vendor Name: AMD 2024-04-03T11:41:54.1401352Z Feature: KERNEL_DISPATCH 2024-04-03T11:41:54.1402039Z Profile: BASE_PROFILE 2024-04-03T11:41:54.1402754Z Float Round Mode: NEAR 2024-04-03T11:41:54.1403475Z Max Queue Number: 128(0x80) 2024-04-03T11:41:54.1404187Z Queue Min Size: 64(0x40) 2024-04-03T11:41:54.1404893Z Queue Max Size: 131072(0x20000) 2024-04-03T11:41:54.1405588Z Queue Type: MULTI 2024-04-03T11:41:54.1406382Z Node: 4 2024-04-03T11:41:54.1407069Z Device Type: GPU 2024-04-03T11:41:54.1407668Z Cache Info: 2024-04-03T11:41:54.1408179Z L1: 16(0x10) KB 2024-04-03T11:41:54.1408817Z L2: 8192(0x2000) KB 2024-04-03T11:41:54.1409815Z Chip ID: 26273(0x66a1) 2024-04-03T11:41:54.1417341Z ASIC Revision: 1(0x1) 2024-04-03T11:41:54.1418050Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1418770Z Max Clock Freq. (MHz): 1800 2024-04-03T11:41:54.1419459Z BDFID: 8960 2024-04-03T11:41:54.1420240Z Internal Node ID: 4 2024-04-03T11:41:54.1420988Z Compute Unit: 64 2024-04-03T11:41:54.1421823Z SIMDs per CU: 4 2024-04-03T11:41:54.1422657Z Shader Engines: 4 2024-04-03T11:41:54.1423525Z Shader Arrs. per Eng.: 1 2024-04-03T11:41:54.1424428Z WatchPts on Addr. Ranges:4 2024-04-03T11:41:54.1425318Z Coherent Host Access: FALSE 2024-04-03T11:41:54.1426108Z Features: KERNEL_DISPATCH 2024-04-03T11:41:54.1426907Z Fast F16 Operation: TRUE 2024-04-03T11:41:54.1427785Z Wavefront Size: 64(0x40) 2024-04-03T11:41:54.1428664Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1429437Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1430117Z x 1024(0x400) 2024-04-03T11:41:54.1430858Z y 1024(0x400) 2024-04-03T11:41:54.1431594Z z 1024(0x400) 2024-04-03T11:41:54.1432404Z Max Waves Per CU: 40(0x28) 2024-04-03T11:41:54.1433409Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:41:54.1434289Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1434985Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1435539Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1436166Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1436797Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1437485Z Max fbarriers/Workgrp: 32 2024-04-03T11:41:54.1453613Z Packet Processor uCode:: 469 2024-04-03T11:41:54.1454389Z SDMA engine uCode:: 145 2024-04-03T11:41:54.1455128Z IOMMU Support:: None 2024-04-03T11:41:54.1455740Z Pool Info: 2024-04-03T11:41:54.1456183Z Pool 1 2024-04-03T11:41:54.1456753Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1457465Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1458164Z Allocatable: TRUE 2024-04-03T11:41:54.1458891Z Alloc Granule: 4KB 2024-04-03T11:41:54.1459805Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1460559Z Accessible by all: FALSE 2024-04-03T11:41:54.1461231Z Pool 2 2024-04-03T11:41:54.1461898Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-04-03T11:41:54.1462735Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1463765Z Allocatable: TRUE 2024-04-03T11:41:54.1464636Z Alloc Granule: 4KB 2024-04-03T11:41:54.1465507Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1466393Z Accessible by all: FALSE 2024-04-03T11:41:54.1467136Z Pool 3 2024-04-03T11:41:54.1467801Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1468813Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1469640Z Allocatable: TRUE 2024-04-03T11:41:54.1470522Z Alloc Granule: 4KB 2024-04-03T11:41:54.1471394Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1472282Z Accessible by all: FALSE 2024-04-03T11:41:54.1473032Z Pool 4 2024-04-03T11:41:54.1473686Z Segment: GROUP 2024-04-03T11:41:54.1474508Z Size: 64(0x40) KB 2024-04-03T11:41:54.1475215Z Allocatable: FALSE 2024-04-03T11:41:54.1475926Z Alloc Granule: 0KB 2024-04-03T11:41:54.1476644Z Alloc Alignment: 0KB 2024-04-03T11:41:54.1477382Z Accessible by all: FALSE 2024-04-03T11:41:54.1477998Z ISA Info: 2024-04-03T11:41:54.1478434Z ISA 1 2024-04-03T11:41:54.1479150Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:41:54.1479909Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:41:54.1480651Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:41:54.1481394Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1482148Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1482870Z Fast f16: TRUE 2024-04-03T11:41:54.1483583Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1484238Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1484836Z x 1024(0x400) 2024-04-03T11:41:54.1485461Z y 1024(0x400) 2024-04-03T11:41:54.1486088Z z 1024(0x400) 2024-04-03T11:41:54.1486775Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1487413Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1487976Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1488629Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1489281Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1489965Z FBarrier Max Size: 32 2024-04-03T11:41:54.1490564Z ******* 2024-04-03T11:41:54.1490974Z Agent 6 2024-04-03T11:41:54.1491386Z ******* 2024-04-03T11:41:54.1491889Z Name: gfx906 2024-04-03T11:41:54.1492621Z Uuid: GPU-3014910172e62698 2024-04-03T11:41:54.1493328Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.1494046Z Vendor Name: AMD 2024-04-03T11:41:54.1494742Z Feature: KERNEL_DISPATCH 2024-04-03T11:41:54.1495429Z Profile: BASE_PROFILE 2024-04-03T11:41:54.1496420Z Float Round Mode: NEAR 2024-04-03T11:41:54.1497154Z Max Queue Number: 128(0x80) 2024-04-03T11:41:54.1497868Z Queue Min Size: 64(0x40) 2024-04-03T11:41:54.1498576Z Queue Max Size: 131072(0x20000) 2024-04-03T11:41:54.1499272Z Queue Type: MULTI 2024-04-03T11:41:54.1499993Z Node: 5 2024-04-03T11:41:54.1500767Z Device Type: GPU 2024-04-03T11:41:54.1501459Z Cache Info: 2024-04-03T11:41:54.1502062Z L1: 16(0x10) KB 2024-04-03T11:41:54.1502820Z L2: 8192(0x2000) KB 2024-04-03T11:41:54.1503612Z Chip ID: 26273(0x66a1) 2024-04-03T11:41:54.1504450Z ASIC Revision: 1(0x1) 2024-04-03T11:41:54.1505311Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1506163Z Max Clock Freq. (MHz): 1800 2024-04-03T11:41:54.1506982Z BDFID: 9728 2024-04-03T11:41:54.1507794Z Internal Node ID: 5 2024-04-03T11:41:54.1508637Z Compute Unit: 64 2024-04-03T11:41:54.1509478Z SIMDs per CU: 4 2024-04-03T11:41:54.1510325Z Shader Engines: 4 2024-04-03T11:41:54.1511197Z Shader Arrs. per Eng.: 1 2024-04-03T11:41:54.1512091Z WatchPts on Addr. Ranges:4 2024-04-03T11:41:54.1512981Z Coherent Host Access: FALSE 2024-04-03T11:41:54.1513763Z Features: KERNEL_DISPATCH 2024-04-03T11:41:54.1514542Z Fast F16 Operation: TRUE 2024-04-03T11:41:54.1515268Z Wavefront Size: 64(0x40) 2024-04-03T11:41:54.1515986Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1516623Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1517185Z x 1024(0x400) 2024-04-03T11:41:54.1517798Z y 1024(0x400) 2024-04-03T11:41:54.1518424Z z 1024(0x400) 2024-04-03T11:41:54.1519102Z Max Waves Per CU: 40(0x28) 2024-04-03T11:41:54.1519932Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:41:54.1520672Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1521313Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1521851Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1522481Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1523110Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1523819Z Max fbarriers/Workgrp: 32 2024-04-03T11:41:54.1524573Z Packet Processor uCode:: 469 2024-04-03T11:41:54.1525313Z SDMA engine uCode:: 145 2024-04-03T11:41:54.1526044Z IOMMU Support:: None 2024-04-03T11:41:54.1526651Z Pool Info: 2024-04-03T11:41:54.1527093Z Pool 1 2024-04-03T11:41:54.1527660Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1528359Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1529047Z Allocatable: TRUE 2024-04-03T11:41:54.1529873Z Alloc Granule: 4KB 2024-04-03T11:41:54.1530606Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1531346Z Accessible by all: FALSE 2024-04-03T11:41:54.1531968Z Pool 2 2024-04-03T11:41:54.1532542Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-04-03T11:41:54.1533240Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1534033Z Allocatable: TRUE 2024-04-03T11:41:54.1534760Z Alloc Granule: 4KB 2024-04-03T11:41:54.1535495Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1536236Z Accessible by all: FALSE 2024-04-03T11:41:54.1536855Z Pool 3 2024-04-03T11:41:54.1537430Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1538126Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1538817Z Allocatable: TRUE 2024-04-03T11:41:54.1539527Z Alloc Granule: 4KB 2024-04-03T11:41:54.1540312Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1541047Z Accessible by all: FALSE 2024-04-03T11:41:54.1541671Z Pool 4 2024-04-03T11:41:54.1542209Z Segment: GROUP 2024-04-03T11:41:54.1542882Z Size: 64(0x40) KB 2024-04-03T11:41:54.1543569Z Allocatable: FALSE 2024-04-03T11:41:54.1544278Z Alloc Granule: 0KB 2024-04-03T11:41:54.1545004Z Alloc Alignment: 0KB 2024-04-03T11:41:54.1545737Z Accessible by all: FALSE 2024-04-03T11:41:54.1546352Z ISA Info: 2024-04-03T11:41:54.1546797Z ISA 1 2024-04-03T11:41:54.1547451Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:41:54.1548219Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:41:54.1548954Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:41:54.1549709Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1550463Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1551189Z Fast f16: TRUE 2024-04-03T11:41:54.1551909Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1552561Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1553155Z x 1024(0x400) 2024-04-03T11:41:54.1553786Z y 1024(0x400) 2024-04-03T11:41:54.1554416Z z 1024(0x400) 2024-04-03T11:41:54.1555106Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1555749Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1556313Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1556969Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1557615Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1558317Z FBarrier Max Size: 32 2024-04-03T11:41:54.1558923Z ******* 2024-04-03T11:41:54.1559343Z Agent 7 2024-04-03T11:41:54.1559758Z ******* 2024-04-03T11:41:54.1560376Z Name: gfx906 2024-04-03T11:41:54.1561124Z Uuid: GPU-72a690c172edb148 2024-04-03T11:41:54.1561833Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.1562565Z Vendor Name: AMD 2024-04-03T11:41:54.1563256Z Feature: KERNEL_DISPATCH 2024-04-03T11:41:54.1563958Z Profile: BASE_PROFILE 2024-04-03T11:41:54.1564774Z Float Round Mode: NEAR 2024-04-03T11:41:54.1565501Z Max Queue Number: 128(0x80) 2024-04-03T11:41:54.1566214Z Queue Min Size: 64(0x40) 2024-04-03T11:41:54.1566926Z Queue Max Size: 131072(0x20000) 2024-04-03T11:41:54.1567623Z Queue Type: MULTI 2024-04-03T11:41:54.1568295Z Node: 6 2024-04-03T11:41:54.1568961Z Device Type: GPU 2024-04-03T11:41:54.1569556Z Cache Info: 2024-04-03T11:41:54.1570071Z L1: 16(0x10) KB 2024-04-03T11:41:54.1570713Z L2: 8192(0x2000) KB 2024-04-03T11:41:54.1571371Z Chip ID: 26273(0x66a1) 2024-04-03T11:41:54.1572070Z ASIC Revision: 1(0x1) 2024-04-03T11:41:54.1572784Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1573500Z Max Clock Freq. (MHz): 1800 2024-04-03T11:41:54.1574181Z BDFID: 25344 2024-04-03T11:41:54.1574865Z Internal Node ID: 6 2024-04-03T11:41:54.1575572Z Compute Unit: 64 2024-04-03T11:41:54.1576277Z SIMDs per CU: 4 2024-04-03T11:41:54.1576980Z Shader Engines: 4 2024-04-03T11:41:54.1577706Z Shader Arrs. per Eng.: 1 2024-04-03T11:41:54.1578462Z WatchPts on Addr. Ranges:4 2024-04-03T11:41:54.1579212Z Coherent Host Access: FALSE 2024-04-03T11:41:54.1579995Z Features: KERNEL_DISPATCH 2024-04-03T11:41:54.1580657Z Fast F16 Operation: TRUE 2024-04-03T11:41:54.1581378Z Wavefront Size: 64(0x40) 2024-04-03T11:41:54.1582092Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1582735Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1583298Z x 1024(0x400) 2024-04-03T11:41:54.1583916Z y 1024(0x400) 2024-04-03T11:41:54.1584528Z z 1024(0x400) 2024-04-03T11:41:54.1585206Z Max Waves Per CU: 40(0x28) 2024-04-03T11:41:54.1586028Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:41:54.1586763Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1587393Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1587923Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1588561Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1589191Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1589877Z Max fbarriers/Workgrp: 32 2024-04-03T11:41:54.1590627Z Packet Processor uCode:: 469 2024-04-03T11:41:54.1591361Z SDMA engine uCode:: 145 2024-04-03T11:41:54.1592217Z IOMMU Support:: None 2024-04-03T11:41:54.1592842Z Pool Info: 2024-04-03T11:41:54.1593281Z Pool 1 2024-04-03T11:41:54.1593847Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1594546Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1595237Z Allocatable: TRUE 2024-04-03T11:41:54.1596050Z Alloc Granule: 4KB 2024-04-03T11:41:54.1596769Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1597506Z Accessible by all: FALSE 2024-04-03T11:41:54.1598125Z Pool 2 2024-04-03T11:41:54.1598694Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-04-03T11:41:54.1599400Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1600099Z Allocatable: TRUE 2024-04-03T11:41:54.1600817Z Alloc Granule: 4KB 2024-04-03T11:41:54.1601542Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1602272Z Accessible by all: FALSE 2024-04-03T11:41:54.1602891Z Pool 3 2024-04-03T11:41:54.1603450Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1604152Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1604836Z Allocatable: TRUE 2024-04-03T11:41:54.1605553Z Alloc Granule: 4KB 2024-04-03T11:41:54.1606281Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1607031Z Accessible by all: FALSE 2024-04-03T11:41:54.1607657Z Pool 4 2024-04-03T11:41:54.1608199Z Segment: GROUP 2024-04-03T11:41:54.1608871Z Size: 64(0x40) KB 2024-04-03T11:41:54.1609557Z Allocatable: FALSE 2024-04-03T11:41:54.1610277Z Alloc Granule: 0KB 2024-04-03T11:41:54.1610991Z Alloc Alignment: 0KB 2024-04-03T11:41:54.1611738Z Accessible by all: FALSE 2024-04-03T11:41:54.1612353Z ISA Info: 2024-04-03T11:41:54.1612789Z ISA 1 2024-04-03T11:41:54.1613452Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:41:54.1614207Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:41:54.1614946Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:41:54.1615683Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1616443Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1617164Z Fast f16: TRUE 2024-04-03T11:41:54.1617880Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1618531Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1619124Z x 1024(0x400) 2024-04-03T11:41:54.1619793Z y 1024(0x400) 2024-04-03T11:41:54.1620422Z z 1024(0x400) 2024-04-03T11:41:54.1621120Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1621753Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1622312Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1623064Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1623714Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1624409Z FBarrier Max Size: 32 2024-04-03T11:41:54.1625008Z ******* 2024-04-03T11:41:54.1625418Z Agent 8 2024-04-03T11:41:54.1625829Z ******* 2024-04-03T11:41:54.1626326Z Name: gfx906 2024-04-03T11:41:54.1627194Z Uuid: GPU-301890c172e62698 2024-04-03T11:41:54.1627907Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.1628628Z Vendor Name: AMD 2024-04-03T11:41:54.1629317Z Feature: KERNEL_DISPATCH 2024-04-03T11:41:54.1630012Z Profile: BASE_PROFILE 2024-04-03T11:41:54.1630726Z Float Round Mode: NEAR 2024-04-03T11:41:54.1631450Z Max Queue Number: 128(0x80) 2024-04-03T11:41:54.1632166Z Queue Min Size: 64(0x40) 2024-04-03T11:41:54.1632875Z Queue Max Size: 131072(0x20000) 2024-04-03T11:41:54.1633575Z Queue Type: MULTI 2024-04-03T11:41:54.1634241Z Node: 7 2024-04-03T11:41:54.1634926Z Device Type: GPU 2024-04-03T11:41:54.1635524Z Cache Info: 2024-04-03T11:41:54.1636040Z L1: 16(0x10) KB 2024-04-03T11:41:54.1636675Z L2: 8192(0x2000) KB 2024-04-03T11:41:54.1637331Z Chip ID: 26273(0x66a1) 2024-04-03T11:41:54.1638028Z ASIC Revision: 1(0x1) 2024-04-03T11:41:54.1638730Z Cacheline Size: 64(0x40) 2024-04-03T11:41:54.1639444Z Max Clock Freq. (MHz): 1800 2024-04-03T11:41:54.1640122Z BDFID: 26112 2024-04-03T11:41:54.1640800Z Internal Node ID: 7 2024-04-03T11:41:54.1641509Z Compute Unit: 64 2024-04-03T11:41:54.1642220Z SIMDs per CU: 4 2024-04-03T11:41:54.1642919Z Shader Engines: 4 2024-04-03T11:41:54.1643643Z Shader Arrs. per Eng.: 1 2024-04-03T11:41:54.1644394Z WatchPts on Addr. Ranges:4 2024-04-03T11:41:54.1645139Z Coherent Host Access: FALSE 2024-04-03T11:41:54.1645793Z Features: KERNEL_DISPATCH 2024-04-03T11:41:54.1646447Z Fast F16 Operation: TRUE 2024-04-03T11:41:54.1647169Z Wavefront Size: 64(0x40) 2024-04-03T11:41:54.1647887Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1648523Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1649089Z x 1024(0x400) 2024-04-03T11:41:54.1649703Z y 1024(0x400) 2024-04-03T11:41:54.1650331Z z 1024(0x400) 2024-04-03T11:41:54.1651007Z Max Waves Per CU: 40(0x28) 2024-04-03T11:41:54.1651827Z Max Work-item Per CU: 2560(0xa00) 2024-04-03T11:41:54.1652559Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1653180Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1653825Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1654463Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1655096Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1655783Z Max fbarriers/Workgrp: 32 2024-04-03T11:41:54.1656534Z Packet Processor uCode:: 469 2024-04-03T11:41:54.1657275Z SDMA engine uCode:: 145 2024-04-03T11:41:54.1658083Z IOMMU Support:: None 2024-04-03T11:41:54.1658682Z Pool Info: 2024-04-03T11:41:54.1659124Z Pool 1 2024-04-03T11:41:54.1659809Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-04-03T11:41:54.1660517Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1661211Z Allocatable: TRUE 2024-04-03T11:41:54.1661940Z Alloc Granule: 4KB 2024-04-03T11:41:54.1662667Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1663399Z Accessible by all: FALSE 2024-04-03T11:41:54.1664019Z Pool 2 2024-04-03T11:41:54.1664600Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-04-03T11:41:54.1665303Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1666001Z Allocatable: TRUE 2024-04-03T11:41:54.1666716Z Alloc Granule: 4KB 2024-04-03T11:41:54.1667441Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1668176Z Accessible by all: FALSE 2024-04-03T11:41:54.1668793Z Pool 3 2024-04-03T11:41:54.1669352Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-04-03T11:41:54.1670058Z Size: 33538048(0x1ffc000) KB 2024-04-03T11:41:54.1670744Z Allocatable: TRUE 2024-04-03T11:41:54.1671460Z Alloc Granule: 4KB 2024-04-03T11:41:54.1672186Z Alloc Alignment: 4KB 2024-04-03T11:41:54.1672926Z Accessible by all: FALSE 2024-04-03T11:41:54.1673552Z Pool 4 2024-04-03T11:41:54.1674095Z Segment: GROUP 2024-04-03T11:41:54.1674768Z Size: 64(0x40) KB 2024-04-03T11:41:54.1675457Z Allocatable: FALSE 2024-04-03T11:41:54.1676166Z Alloc Granule: 0KB 2024-04-03T11:41:54.1676891Z Alloc Alignment: 0KB 2024-04-03T11:41:54.1677626Z Accessible by all: FALSE 2024-04-03T11:41:54.1678239Z ISA Info: 2024-04-03T11:41:54.1678687Z ISA 1 2024-04-03T11:41:54.1679344Z Name: amdgcn-amd-amdhsa--gfx906:sramecc+:xnack- 2024-04-03T11:41:54.1680105Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-04-03T11:41:54.1680840Z Profiles: HSA_PROFILE_BASE 2024-04-03T11:41:54.1681585Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1682346Z Default Rounding Mode: NEAR 2024-04-03T11:41:54.1683069Z Fast f16: TRUE 2024-04-03T11:41:54.1683783Z Workgroup Max Size: 1024(0x400) 2024-04-03T11:41:54.1684431Z Workgroup Max Size per Dimension: 2024-04-03T11:41:54.1685144Z x 1024(0x400) 2024-04-03T11:41:54.1685788Z y 1024(0x400) 2024-04-03T11:41:54.1686412Z z 1024(0x400) 2024-04-03T11:41:54.1687098Z Grid Max Size: 4294967295(0xffffffff) 2024-04-03T11:41:54.1687735Z Grid Max Size per Dimension: 2024-04-03T11:41:54.1688297Z x 4294967295(0xffffffff) 2024-04-03T11:41:54.1689056Z y 4294967295(0xffffffff) 2024-04-03T11:41:54.1689699Z z 4294967295(0xffffffff) 2024-04-03T11:41:54.1690385Z FBarrier Max Size: 32 2024-04-03T11:41:54.1690992Z *** Done *** 2024-04-03T11:41:54.1691408Z + rocminfo 2024-04-03T11:41:54.1691843Z + grep -E 'Name:.*\sgfx|Marketing' 2024-04-03T11:41:54.2046037Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.2046949Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.2047898Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.2048885Z Marketing Name: AMD EPYC 7601 32-Core Processor 2024-04-03T11:41:54.2049728Z Name: gfx906 2024-04-03T11:41:54.2050567Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.2051414Z Name: gfx906 2024-04-03T11:41:54.2052268Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.2053106Z Name: gfx906 2024-04-03T11:41:54.2053970Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.2054803Z Name: gfx906 2024-04-03T11:41:54.2055639Z Marketing Name: AMD Radeon Graphics 2024-04-03T11:41:54.2282841Z + [[ linux-focal-rocm6.0-py3.8 == *xpu* ]] 2024-04-03T11:41:54.2283515Z + [[ linux-focal-rocm6.0-py3.8 != *-bazel-* ]] 2024-04-03T11:41:54.2284126Z + pip_install --user ninja==1.10.2 2024-04-03T11:41:54.2284771Z + pip install --progress-bar off --user ninja==1.10.2 2024-04-03T11:41:54.7564153Z Collecting ninja==1.10.2 2024-04-03T11:41:54.7990894Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2024-04-03T11:41:54.8225322Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2024-04-03T11:41:55.7201843Z Installing collected packages: ninja 2024-04-03T11:41:55.7304153Z  WARNING: The script ninja is installed in '/var/lib/jenkins/.local/bin' which is not on PATH. 2024-04-03T11:41:55.7308327Z Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location. 2024-04-03T11:41:55.7347668Z Successfully installed ninja-1.10.2 2024-04-03T11:41:55.8591349Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.8/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-04-03T11:41:55.8600356Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.8/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-04-03T11:41:55.8605443Z + install_tlparse 2024-04-03T11:41:55.8606812Z + pip_install --user tlparse==0.3.7 2024-04-03T11:41:55.8608591Z + pip install --progress-bar off --user tlparse==0.3.7 2024-04-03T11:41:56.3646716Z Collecting tlparse==0.3.7 2024-04-03T11:41:56.3968950Z Downloading tlparse-0.3.7-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (346 bytes) 2024-04-03T11:41:56.4111448Z Downloading tlparse-0.3.7-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.2 MB) 2024-04-03T11:41:57.3598374Z Installing collected packages: tlparse 2024-04-03T11:41:57.4014973Z Successfully installed tlparse-0.3.7 2024-04-03T11:41:57.5164853Z ++ python -m site --user-base 2024-04-03T11:41:57.5465962Z + PATH=/var/lib/jenkins/.local/bin:/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.8/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-04-03T11:41:57.5473039Z + [[ linux-focal-rocm6.0-py3.8 == *asan* ]] 2024-04-03T11:41:57.5474771Z + [[ linux-focal-rocm6.0-py3.8 == *-debug* ]] 2024-04-03T11:41:57.5476531Z + [[ linux-focal-rocm6.0-py3.8 != *-bazel-* ]] 2024-04-03T11:41:57.5479127Z + echo 'We are not in debug mode: linux-focal-rocm6.0-py3.8. Expect the assertion to pass' 2024-04-03T11:41:57.5482333Z We are not in debug mode: linux-focal-rocm6.0-py3.8. Expect the assertion to pass 2024-04-03T11:41:57.5484406Z + cd test 2024-04-03T11:41:57.5486126Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2024-04-03T11:42:00.2664203Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2024-04-03T11:42:00.2665847Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2024-04-03T11:42:00.2671828Z + DYNAMO_BENCHMARK_FLAGS=() 2024-04-03T11:42:00.2673160Z + [[ default == *dynamo_eager* ]] 2024-04-03T11:42:00.2674443Z + [[ default == *aot_eager* ]] 2024-04-03T11:42:00.2675669Z + [[ default == *aot_inductor* ]] 2024-04-03T11:42:00.2676963Z + [[ default == *inductor* ]] 2024-04-03T11:42:00.2678154Z + [[ default == *dynamic* ]] 2024-04-03T11:42:00.2679373Z + [[ default == *cpu_inductor* ]] 2024-04-03T11:42:00.2681177Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2024-04-03T11:42:00.2711390Z + [[ linux-focal-rocm6.0-py3.8 == *tbb* ]] 2024-04-03T11:42:00.2757953Z + [[ linux-focal-rocm6.0-py3.8 == *libtorch* ]] 2024-04-03T11:42:00.2759280Z + [[ linux-focal-rocm6.0-py3.8 == *-bazel-* ]] 2024-04-03T11:42:00.2760307Z + cd test 2024-04-03T11:42:00.2761354Z + python -c 'import torch; print(torch.__config__.show())' 2024-04-03T11:42:02.0256626Z PyTorch built with: 2024-04-03T11:42:02.0257498Z - GCC 9.4 2024-04-03T11:42:02.0257967Z - C++ Version: 201703 2024-04-03T11:42:02.0259198Z - Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-04-03T11:42:02.0260895Z - Intel(R) MKL-DNN v3.3.6 (Git Hash 86e6af5974177e513fd3fee58425e1063e7f1361) 2024-04-03T11:42:02.0261892Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-04-03T11:42:02.0262648Z - LAPACK is enabled (usually provided by MKL) 2024-04-03T11:42:02.0263379Z - NNPACK is enabled 2024-04-03T11:42:02.0263931Z - CPU capability usage: AVX2 2024-04-03T11:42:02.0264439Z - HIP Runtime 6.0.32830 2024-04-03T11:42:02.0264895Z - MIOpen 3.0.0 2024-04-03T11:42:02.0265293Z - Magma 2.7.2 2024-04-03T11:42:02.0274092Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -D_GLIBCXX_USE_CXX11_ABI=1 -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DUSE_FBGEMM -DUSE_QNNPACK -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-type-limits -Wno-array-bounds -Wno-unknown-pragmas -Wno-unused-parameter -Wno-unused-function -Wno-unused-result -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=pedantic -Wno-error=old-style-cast -Wno-missing-braces -fdiagnostics-color=always -faligned-new -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, PERF_WITH_AVX512=1, TORCH_VERSION=2.4.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=ON, USE_ROCM_KERNEL_ASSERT=OFF, 2024-04-03T11:42:02.0281869Z 2024-04-03T11:42:02.9867239Z + cd test 2024-04-03T11:42:02.9869225Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2024-04-03T11:42:04.7557448Z ATen/Parallel: 2024-04-03T11:42:04.7558235Z at::get_num_threads() : 32 2024-04-03T11:42:04.7558954Z at::get_num_interop_threads() : 64 2024-04-03T11:42:04.7559695Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-04-03T11:42:04.7560710Z omp_get_max_threads() : 32 2024-04-03T11:42:04.7562244Z Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-04-03T11:42:04.7563425Z mkl_get_max_threads() : 32 2024-04-03T11:42:04.7564346Z Intel(R) MKL-DNN v3.3.6 (Git Hash 86e6af5974177e513fd3fee58425e1063e7f1361) 2024-04-03T11:42:04.7565301Z std::thread::hardware_concurrency() : 64 2024-04-03T11:42:04.7565939Z Environment variables: 2024-04-03T11:42:04.7566461Z OMP_NUM_THREADS : [not set] 2024-04-03T11:42:04.7567033Z MKL_NUM_THREADS : [not set] 2024-04-03T11:42:04.7567594Z ATen parallel backend: OpenMP 2024-04-03T11:42:04.7567966Z 2024-04-03T11:42:05.6921920Z + [[ default == *backward* ]] 2024-04-03T11:42:05.6923651Z + [[ default == *xla* ]] 2024-04-03T11:42:05.6924829Z + [[ default == *executorch* ]] 2024-04-03T11:42:05.6926140Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2024-04-03T11:42:05.6927925Z + [[ linux-focal-rocm6.0-py3.8 == *libtorch* ]] 2024-04-03T11:42:05.6929430Z + [[ default == distributed ]] 2024-04-03T11:42:05.6930687Z + [[ default == deploy ]] 2024-04-03T11:42:05.6931917Z + [[ default == *inductor_distributed* ]] 2024-04-03T11:42:05.6933314Z + [[ default == *huggingface* ]] 2024-04-03T11:42:05.6934549Z + [[ default == *timm* ]] 2024-04-03T11:42:05.6935697Z + [[ default == *torchbench* ]] 2024-04-03T11:42:05.6937207Z + [[ default == *inductor_cpp_wrapper_abi_compatible* ]] 2024-04-03T11:42:05.6938806Z + [[ default == *inductor* ]] 2024-04-03T11:42:05.6940171Z + [[ default == *dynamo* ]] 2024-04-03T11:42:05.6941408Z + [[ default == *dynamo* ]] 2024-04-03T11:42:05.6942526Z + [[ 1 == 1 ]] 2024-04-03T11:42:05.6943566Z + [[ 1 -gt 1 ]] 2024-04-03T11:42:05.6944501Z + [[ 1 == 2 ]] 2024-04-03T11:42:05.6945518Z + [[ 1 -gt 2 ]] 2024-04-03T11:42:05.6946800Z + [[ linux-focal-rocm6.0-py3.8 == *vulkan* ]] 2024-04-03T11:42:05.6948533Z + [[ linux-focal-rocm6.0-py3.8 == *-bazel-* ]] 2024-04-03T11:42:05.6950660Z + [[ linux-focal-rocm6.0-py3.8 == *-mobile-lightweight-dispatch* ]] 2024-04-03T11:42:05.6952501Z + [[ default = docs_test ]] 2024-04-03T11:42:05.6953929Z + [[ linux-focal-rocm6.0-py3.8 == *rocm* ]] 2024-04-03T11:42:05.6956947Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor ]] 2024-04-03T11:42:05.6959578Z + install_torchvision 2024-04-03T11:42:05.6960643Z + local orig_preload 2024-04-03T11:42:05.6961671Z + local commit 2024-04-03T11:42:05.6962671Z ++ get_pinned_commit vision 2024-04-03T11:42:05.6963928Z ++ cat .github/ci_commit_pins/vision.txt 2024-04-03T11:42:05.6965505Z + commit=2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:05.6966946Z + orig_preload= 2024-04-03T11:42:05.6968013Z + '[' -n '' ']' 2024-04-03T11:42:05.6970686Z + pip_install --no-use-pep517 --user git+https://github.com/pytorch/vision.git@2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:05.6975402Z + pip install --progress-bar off --no-use-pep517 --user git+https://github.com/pytorch/vision.git@2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:06.1604109Z Collecting git+https://github.com/pytorch/vision.git@2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:06.1608562Z Cloning https://github.com/pytorch/vision.git (to revision 2c4665ffbb64f03f5d18016d3398af4ac4da5f03) to /tmp/pip-req-build-kow2r29o 2024-04-03T11:42:06.1702959Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-kow2r29o 2024-04-03T11:42:09.5644197Z Running command git rev-parse -q --verify 'sha^2c4665ffbb64f03f5d18016d3398af4ac4da5f03' 2024-04-03T11:42:09.5701679Z Running command git fetch -q https://github.com/pytorch/vision.git 2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:11.4550669Z Running command git checkout -q 2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:12.3232604Z Resolved https://github.com/pytorch/vision.git to commit 2c4665ffbb64f03f5d18016d3398af4ac4da5f03 2024-04-03T11:42:17.8606453Z Preparing metadata (setup.py) ... [?25l- \ | / - done 2024-04-03T11:42:17.8735906Z [?25hRequirement already satisfied: numpy in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torchvision==0.19.0a0+2c4665f) (1.21.2) 2024-04-03T11:42:17.8741593Z Requirement already satisfied: torch in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torchvision==0.19.0a0+2c4665f) (2.4.0a0+gitd253bd7) 2024-04-03T11:42:17.8747067Z Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torchvision==0.19.0a0+2c4665f) (10.2.0) 2024-04-03T11:42:17.9019127Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (3.13.1) 2024-04-03T11:42:17.9024972Z Requirement already satisfied: typing-extensions>=4.8.0 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (4.10.0) 2024-04-03T11:42:17.9030466Z Requirement already satisfied: sympy in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (1.12) 2024-04-03T11:42:17.9035682Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (2.8.8) 2024-04-03T11:42:17.9040891Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (3.1.3) 2024-04-03T11:42:17.9046130Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from torch->torchvision==0.19.0a0+2c4665f) (2024.2.0) 2024-04-03T11:42:17.9598952Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from jinja2->torch->torchvision==0.19.0a0+2c4665f) (2.1.5) 2024-04-03T11:42:17.9797625Z Requirement already satisfied: mpmath>=0.19 in /opt/conda/envs/py_3.8/lib/python3.8/site-packages (from sympy->torch->torchvision==0.19.0a0+2c4665f) (1.3.0) 2024-04-03T11:42:17.9889053Z Building wheels for collected packages: torchvision 2024-04-03T11:43:22.8018667Z Building wheel for torchvision (setup.py) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2024-04-03T11:43:22.8073164Z [?25h Created wheel for torchvision: filename=torchvision-0.19.0a0+2c4665f-cp38-cp38-linux_x86_64.whl size=1453866 sha256=6a5dcb88bebe3edb1073fde66acc0a665fcc4a4b197227470eafafc45428756f 2024-04-03T11:43:22.8080427Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/18/2c/86/874968eb88a9b731571b4749697300d3acf254c3cf8b4a52f1 2024-04-03T11:43:22.8146921Z Successfully built torchvision 2024-04-03T11:43:23.6068490Z Installing collected packages: torchvision 2024-04-03T11:43:24.0615855Z Successfully installed torchvision-0.19.0a0+2c4665f 2024-04-03T11:43:24.2014245Z + '[' -n '' ']' 2024-04-03T11:43:24.2015273Z + test_python 2024-04-03T11:43:24.2019520Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor --verbose 2024-04-03T11:43:24.3002316Z test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-04-03T11:43:24.3005425Z import pkg_resources 2024-04-03T11:43:27.4755157Z Downloading https://ossci-metrics.s3.amazonaws.com/slow-tests.json to /var/lib/jenkins/pytorch/test/.pytorch-slow-tests.json 2024-04-03T11:43:27.8370050Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2024-04-03T11:43:28.2799417Z Ignoring disabled issues: [''] 2024-04-03T11:43:28.2926429Z Found test times from artifacts 2024-04-03T11:43:28.3321805Z Found test times from artifacts 2024-04-03T11:43:28.3329165Z Running all tests 2024-04-03T11:43:28.3334715Z Name: tests to run (est. time: 76.77min) 2024-04-03T11:43:28.3336120Z Serial tests (6): 2024-04-03T11:43:28.3337929Z inductor/test_torchinductor 1/2 2024-04-03T11:43:28.3339916Z inductor/test_torchinductor 2/2 2024-04-03T11:43:28.3341232Z test_autograd 1/1 2024-04-03T11:43:28.3342315Z test_cuda 1/1 2024-04-03T11:43:28.3343294Z test_torch 1/1 2024-04-03T11:43:28.3344336Z test_nn 1/1 2024-04-03T11:43:28.3345387Z Parallel tests (12): 2024-04-03T11:43:28.3346460Z test_ops 1/10 2024-04-03T11:43:28.3347454Z test_ops 2/10 2024-04-03T11:43:28.3348430Z test_ops 3/10 2024-04-03T11:43:28.3349493Z test_ops 4/10 2024-04-03T11:43:28.3350490Z test_ops 5/10 2024-04-03T11:43:28.3351462Z test_ops 6/10 2024-04-03T11:43:28.3352476Z test_ops 7/10 2024-04-03T11:43:28.3353459Z test_ops 8/10 2024-04-03T11:43:28.3354452Z test_ops 9/10 2024-04-03T11:43:28.3355423Z test_ops 10/10 2024-04-03T11:43:28.3356449Z test_unary_ufuncs 1/1 2024-04-03T11:43:28.3357613Z test_binary_ufuncs 1/1 2024-04-03T11:43:28.3358808Z Name: excluded (est. time: 0.0min) 2024-04-03T11:43:28.3360120Z Serial tests (0): 2024-04-03T11:43:28.3361164Z Parallel tests (0): 2024-04-03T11:43:28.3363188Z Starting test batch 'tests to run' 0.0 seconds after initiating testing 2024-04-03T11:43:28.3775079Z Running inductor/test_torchinductor 1/2 ... [2024-04-03 11:43:28.376658] 2024-04-03T11:43:28.3783340Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'inductor/test_torchinductor.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 11:43:28.377582] 2024-04-03T11:52:08.0823872Z 2024-04-03T11:52:08.0831885Z inductor/test_torchinductor 1/2 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_1.2_d8c374349ca31492_.log 2024-04-03T11:52:08.1048064Z Running 310 items in this shard: test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_int, test/inductor/test_torchinductor.py::GPUTests::test_AllenaiLongformerBase_repro_cuda, test/inductor/test_torchinductor.py::GPUTests::test_abs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool1d_argmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool_with_output_size_0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_const_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_inplace_permuted_cuda, test/inductor/test_torchinductor.py::GPUTests::test_addmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_alexnet_prefix_cuda, test/inductor/test_torchinductor.py::GPUTests::test_angle_cuda, test/inductor/test_torchinductor.py::GPUTests::test_any_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_batch_norm_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bmm2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_both_scalars_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_add_autotune_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_computed_offsets_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_default_kwargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bucketize_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_neg_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_pos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_single_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_empty_1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cauchy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clamp_type_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clone_cuda, test/inductor/test_torchinductor.py::GPUTests::test_compar_cuda, test/inductor/test_torchinductor.py::GPUTests::test_complex_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_computed_buffer_inlining_cuda, test/inductor/test_torchinductor.py::GPUTests::test_concat_add_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumprod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_const_int32_to_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_fill_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_nd_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv2d_backward_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv2d_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv3d_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cudnn_rnn_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumprod_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_pattern_matcher_issue_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_data_type_propogation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dense_mask_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_by_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_precision_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtype_mismatch_issue_cuda, test/inductor/test_torchinductor.py::GPUTests::test_elu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_bag_byte_unpack_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_bag_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_exp2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_exp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expand_as_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expand_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expanded_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_basic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_with_return_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fft_real_input_real_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fill1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float16_to_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float32_to_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float_index_expression_type_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_floordiv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmod_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_like_cuda, test/inductor/test_torchinductor.py::GPUTests::test_full_truncation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_functionalize_rng_wrappers_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fuse_large_params_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fusing_write_into_disjoint_read_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gelu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_glu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_grid_sampler_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardtanh_cuda, test/inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_floordiv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_propagation_remainder_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_as_masked_fill_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_failed_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_fallback1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_fallback2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_indirect_load_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_assert_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inductor_layout_optimization_input_mutations_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_mixed_dtype_ops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_resize_as_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_where_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_int_input_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_invalid_operand_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_isinf2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_block_sizes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_offset_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_layer_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lerp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_list_clearing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log1p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logsumexp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_long_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_fill_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_fill_promotion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mean_cuda, test/inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_misaligned_address_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_move_arange_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_threading_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_any_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_prime_size_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_sum_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mutations_loop_fusion_cuda, test/inductor/test_torchinductor.py::GPUTests::test_narrow_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_empty_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_new_ones_cuda, test/inductor/test_torchinductor.py::GPUTests::test_one_hot_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pad_cast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pad_view_cuda, test/inductor/test_torchinductor.py::GPUTests::test_philox_rand_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pixel_shuffle_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_airy_ai_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_y1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_t_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfcx_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfinv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_exp2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaln_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_h_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_laguerre_polynomial_l_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_legendre_polynomial_p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_log1p_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_log_ndtr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_i1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_multigammaln_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtri_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_psi_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_v_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_w_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_spherical_bessel_j0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_zeta_cuda, test/inductor/test_torchinductor.py::GPUTests::test_prod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rand_like_deterministic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_kernel_count_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_with_dtype_and_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remainder_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_clone_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_noop_copy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_cuda, test/inductor/test_torchinductor.py::GPUTests::test_roi_align_cuda, test/inductor/test_torchinductor.py::GPUTests::test_round_correctness_cuda, test/inductor/test_torchinductor.py::GPUTests::test_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rsqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_efficient_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_bf16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scheduler_vertical_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_cuda, test/inductor/test_torchinductor.py::GPUTests::test_setitem_with_int_parameter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sgn_cuda, test/inductor/test_torchinductor.py::GPUTests::test_shape_prop_torch_ones_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sign_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_simplify_loops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_single_elem_indirect_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sizehint_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_view_with_graph_break_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_persist_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumprod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_sizes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_sizes_with_unbacked_symints_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_stack_cuda, test/inductor/test_torchinductor.py::GPUTests::test_strided_inputs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum_keepdims_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tanh_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor_index_put_slice_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_topk_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transpose_add_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transpose_cuda, test/inductor/test_torchinductor.py::GPUTests::test_uint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unroll_small_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unsqueeze_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unsqueeze_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bicubic2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_cat_conv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_mean_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vdd_clamp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vertical_fusion1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_as_real_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_where_broadcast_cuda, test/inductor/test_torchinductor.py::GPUTests::test_where_with_logical_op_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zeros_cuda, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_bandwidth_profiler, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_computed_indirect_mask, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_constant_folding_deallocation, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_divisible_by_16_covers_numel_args, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_indirect_device_assert, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_detach_view, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_kernel_names_descriptive, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_autograd, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_split_op_with_sym, test/inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_fail 2024-04-03T11:52:08.1189289Z 2024-04-03T11:52:08.1189886Z Running inductor/test_torchinductor 2/2 ... [2024-04-03 11:52:08.084535] 2024-04-03T11:52:08.1192137Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'inductor/test_torchinductor.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 11:52:08.085669] 2024-04-03T11:59:15.9857539Z 2024-04-03T11:59:15.9862559Z inductor/test_torchinductor 2/2 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_2.2_81c7c8f305edca70_.log 2024-04-03T11:59:16.0082546Z Running 342 items in this shard: test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_dense_transposed, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_broadcast2, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_strided_int, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_strided, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_transposed, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_complex_cuda, test/inductor/test_torchinductor.py::GPUTests::test_add_const_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adding_tensor_offsets_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aliased_buffer_reuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_arange5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_duplicates_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_argmin_with_nan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_min_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_to_float_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_baddbmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bernoulli1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bernoulli2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bfloat16_to_int16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bitwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bmm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_batch_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_copied_in_graph_with_different_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_use_after_remove_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_extern_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_inplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_negative_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_of_loops_and_extern_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_legacy_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_upcasting_cuda, test/inductor/test_torchinductor.py::GPUTests::test_clamp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_bn_fuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_functional_bn_fuse_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_inference_heuristics_cuda, test/inductor/test_torchinductor.py::GPUTests::test_conv_with_as_strided_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_no_mask_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_channels_last_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_fixed_layout_sequential_cuda, test/inductor/test_torchinductor.py::GPUTests::test_diagonal_copy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dist_bf16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dist_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div9_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_prim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_deterministic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_trivial_0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtype_sympy_expr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_embedding_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_erfc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_erfinv_cuda, test/inductor/test_torchinductor.py::GPUTests::test_expm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_list_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fallback_mutable_op_no_mutated_tensors_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fft_real_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fill2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_flip_cuda, test/inductor/test_torchinductor.py::GPUTests::test_float_index_expression_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fmin_fmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_forced_buffer_realize_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fuse_tiled_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_gather_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_getitem_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardsigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardswish_cuda, test/inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_dynamic_shapes_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_deterministic_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_select_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inner_fn_str_and_stride_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_activations_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inplace_add_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_insignificant_strides_cuda, test/inductor/test_torchinductor.py::GPUTests::test_isinf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_issue102546_cuda, test/inductor/test_torchinductor.py::GPUTests::test_kernel_names_cuda, test/inductor/test_torchinductor.py::GPUTests::test_kwargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_l1_loss_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_broadcast_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_pointwise_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_strided_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_tensor_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_leaky_relu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_lgamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_like_rands3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linear_mixed_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log_fp64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log_softmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_logcumsumexp_zero_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_masked_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_min_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_min_max_reduction_nan_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mm_mixed_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mm_views_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_gpu_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multi_gpu_recompile_on_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_var_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_var_lowp_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nan_to_num_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_max_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_forward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_mega_fusion_during_lowering_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_op_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_output_strides_cuda, test/inductor/test_torchinductor.py::GPUTests::test_permute1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_permute2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_v_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_w_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_digamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_entr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_erfc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_expit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_expm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammainc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaincc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_he_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i0e_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_i1e_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_logit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_modified_bessel_k0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_ndtr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_polygamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_scaled_modified_bessel_k1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_t_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_sinc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_xlog1py_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_xlogy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_profiler_mark_wrapper_call_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_int64_mod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_generator_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randn_like_empty_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reflection_pad2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_relu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_remove_no_ops_cuda, test/inductor/test_torchinductor.py::GPUTests::test_repeat_interleave_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_require_stride_expanded_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reuse_buffers_with_aliasing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_roll_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rsqrt_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_add3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_select_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sgn_extremal_cuda, test/inductor/test_torchinductor.py::GPUTests::test_shape_padding_cuda, test/inductor/test_torchinductor.py::GPUTests::test_should_pad_bench_for_bmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_signbit_cuda, test/inductor/test_torchinductor.py::GPUTests::test_silu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sin_cuda, test/inductor/test_torchinductor.py::GPUTests::test_single_elem_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter_reinplace_cuda, test/inductor/test_torchinductor.py::GPUTests::test_softmax_one_kernel_loop_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumprod_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumsum_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_with_sizes_failed_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze_varargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_std_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sum5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tensor_index_slice_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_tmp_not_defined_issue2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_device_constant_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_device_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_dtype_cuda, test/inductor/test_torchinductor.py::GPUTests::test_to_memory_format_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transposed_propagates_cuda, test/inductor/test_torchinductor.py::GPUTests::test_triu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_uint4x2_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unbind_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unfold_zero_dimension_tensor_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_a_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_bilinear2d_b_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest1d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_var_correction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_as_complex_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_detach_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_on_aliased_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views7_cuda, test/inductor/test_torchinductor.py::GPUTests::test_xblock_divides_xnumel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zero_dim_reductions_cuda, test/inductor/test_torchinductor.py::GPUTests::test_zero_element_mutation_cuda, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_cant_optimize_compute, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_config_option_dont_assume_alignment, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads_block_ptr, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_inductor_sequence_nr, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_not_materialize_pointwise_reduction, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_numpy_on_gpu, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_compute, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_assert, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype_with_constraint, test/inductor/test_torchinductor.py::RNNTest::test_rnn_compile_safe, test/inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_pass 2024-04-03T11:59:16.0249321Z 2024-04-03T11:59:16.0249787Z Running test_autograd 1/1 ... [2024-04-03 11:59:15.987635] 2024-04-03T11:59:16.0251764Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_autograd.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 11:59:15.988681] 2024-04-03T11:59:58.8846390Z 2024-04-03T11:59:58.8853287Z test_autograd 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_autograd_1.1_64b93bfb7a9554cc_.log 2024-04-03T11:59:58.9208149Z Running 613 items in this shard: test/test_autograd.py::TestAutograd::test_access_saved_tensor_twice_without_recomputation_works, test/test_autograd.py::TestAutograd::test_accumulate_grad, test/test_autograd.py::TestAutograd::test_accumulate_grad_posthooks_can_observe_tensor_prehook, test/test_autograd.py::TestAutograd::test_accumulate_grad_tensor_reference, test/test_autograd.py::TestAutograd::test_accumulate_grad_with_zero_numel_grad, test/test_autograd.py::TestAutograd::test_anomaly_assign_parent_cleanup, test/test_autograd.py::TestAutograd::test_anomaly_detect_nan, test/test_autograd.py::TestAutograd::test_anomaly_grad_warnings, test/test_autograd.py::TestAutograd::test_anomaly_mode_no_check_nan, test/test_autograd.py::TestAutograd::test_attribute_deletion, test/test_autograd.py::TestAutograd::test_autograd_inplace_view_of_view, test/test_autograd.py::TestAutograd::test_autograd_inplace_views_creation_meta, test/test_autograd.py::TestAutograd::test_autograd_inplace_views_cross_dtype, test/test_autograd.py::TestAutograd::test_autograd_multiple_views_python, test/test_autograd.py::TestAutograd::test_autograd_node_isinstance, test/test_autograd.py::TestAutograd::test_autograd_print_tensor, test/test_autograd.py::TestAutograd::test_autograd_python_custom_function_inplace, test/test_autograd.py::TestAutograd::test_autograd_simple_views_python, test/test_autograd.py::TestAutograd::test_autograd_views_codegen, test/test_autograd.py::TestAutograd::test_backward, test/test_autograd.py::TestAutograd::test_backward_badcalls, test/test_autograd.py::TestAutograd::test_backward_copy, test/test_autograd.py::TestAutograd::test_backward_create_graph_warns, test/test_autograd.py::TestAutograd::test_backward_no_grad, test/test_autograd.py::TestAutograd::test_backward_to_node, test/test_autograd.py::TestAutograd::test_backward_twice_retained_graph_with_saved_values, test/test_autograd.py::TestAutograd::test_backward_twice_retained_graph_without_saved_values, test/test_autograd.py::TestAutograd::test_backward_twice_with_saved_values, test/test_autograd.py::TestAutograd::test_backward_twice_without_saved_values, test/test_autograd.py::TestAutograd::test_backward_with_inputs, test/test_autograd.py::TestAutograd::test_backward_with_nonleaf_inputs, test/test_autograd.py::TestAutograd::test_calculate_shape_util, test/test_autograd.py::TestAutograd::test_callback_adds_callback, test/test_autograd.py::TestAutograd::test_callback_propagates_errors_from_device_thread, test/test_autograd.py::TestAutograd::test_cant_create_saved_tensors, test/test_autograd.py::TestAutograd::test_checkpoint_detects_non_determinism, test/test_autograd.py::TestAutograd::test_checkpoint_sequential_warns_if_use_reentrant_not_passed_explcitly, test/test_autograd.py::TestAutograd::test_checkpoint_valid_reset_on_error, test/test_autograd.py::TestAutograd::test_checkpoint_warns_if_use_reentrant_not_passed_explcitly, test/test_autograd.py::TestAutograd::test_checkpointing, test/test_autograd.py::TestAutograd::test_checkpointing_non_reentrant_autocast_cpu, test/test_autograd.py::TestAutograd::test_checkpointing_non_reentrant_autocast_gpu, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_arbitrary_input_output, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_correct_grad, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_custom_function_works, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_dataparallel, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_detached_tensor_use_reentrant_False, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_detached_tensor_use_reentrant_True, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_input_requires_grad_False, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_input_requires_grad_True, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_memory_savings, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_parameter_used_in_an_out, test/test_autograd.py::TestAutograd::test_checkpointing_without_reentrant_with_context_fn, test/test_autograd.py::TestAutograd::test_copy_slices_graph_task_updates, test/test_autograd.py::TestAutograd::test_create_graph_and_full_backward_hook_cycle, test/test_autograd.py::TestAutograd::test_current_graph_task_execution_order, test/test_autograd.py::TestAutograd::test_current_graph_task_id, test/test_autograd.py::TestAutograd::test_current_node, test/test_autograd.py::TestAutograd::test_custom_autograd_no_early_free, test/test_autograd.py::TestAutograd::test_custom_autograd_repeated_grad_grad, test/test_autograd.py::TestAutograd::test_custom_function_cycle, test/test_autograd.py::TestAutograd::test_custom_function_error, test/test_autograd.py::TestAutograd::test_custom_function_exception, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_forward_is_no_op, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_inplace_checks, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_non_differentiable, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_non_tensor_before_tensor_args, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_view_checks, test/test_autograd.py::TestAutograd::test_custom_function_forward_mode_wrong_formula, test/test_autograd.py::TestAutograd::test_custom_function_local_inplace, test/test_autograd.py::TestAutograd::test_custom_function_mark_dirty_not_differentiable, test/test_autograd.py::TestAutograd::test_custom_function_no_tensors, test/test_autograd.py::TestAutograd::test_custom_function_non_tensor_inputs_outputs, test/test_autograd.py::TestAutograd::test_custom_function_preserve_torch_function_when_return_as_is, test/test_autograd.py::TestAutograd::test_custom_function_return_view_in_nograd, test/test_autograd.py::TestAutograd::test_custom_function_save_for_forward, test/test_autograd.py::TestAutograd::test_custom_function_saved_tensors, test/test_autograd.py::TestAutograd::test_custom_function_setup_context_multi_input, test/test_autograd.py::TestAutograd::test_custom_function_setup_context_multi_output, test/test_autograd.py::TestAutograd::test_custom_function_setup_context_simple, test/test_autograd.py::TestAutograd::test_custom_function_vmap_defaults, test/test_autograd.py::TestAutograd::test_deep_reentrant, test/test_autograd.py::TestAutograd::test_default_saved_variable_hooks_double_backward, test/test_autograd.py::TestAutograd::test_dep_nograd, test/test_autograd.py::TestAutograd::test_dependent_backward, test/test_autograd.py::TestAutograd::test_detach, test/test_autograd.py::TestAutograd::test_detach_base, test/test_autograd.py::TestAutograd::test_detach_then_inplace_raises_in_autograd, test/test_autograd.py::TestAutograd::test_diagonal_expanded_v, test/test_autograd.py::TestAutograd::test_dir, test/test_autograd.py::TestAutograd::test_disabling_saved_tensor_hooks, test/test_autograd.py::TestAutograd::test_disabling_saved_tensor_hooks_nested, test/test_autograd.py::TestAutograd::test_dont_materialize_grads, test/test_autograd.py::TestAutograd::test_duplicate_backward_root, test/test_autograd.py::TestAutograd::test_enable_grad_decorator_no_paren, test/test_autograd.py::TestAutograd::test_first_grad_fn_access_in_no_grad_mode, test/test_autograd.py::TestAutograd::test_free_deep_graph, test/test_autograd.py::TestAutograd::test_free_deep_graph_complicated, test/test_autograd.py::TestAutograd::test_free_deep_graph_pyfunction, test/test_autograd.py::TestAutograd::test_full_backward_hook_double_backward, test/test_autograd.py::TestAutograd::test_function, test/test_autograd.py::TestAutograd::test_function_returns_input, test/test_autograd.py::TestAutograd::test_function_returns_undefined_tensor, test/test_autograd.py::TestAutograd::test_gc_in_destructor, test/test_autograd.py::TestAutograd::test_grad, test/test_autograd.py::TestAutograd::test_grad_badcalls, test/test_autograd.py::TestAutograd::test_grad_batched_grad, test/test_autograd.py::TestAutograd::test_grad_empty_inputs, test/test_autograd.py::TestAutograd::test_grad_fn_attr_bindings, test/test_autograd.py::TestAutograd::test_grad_fn_badcalls, test/test_autograd.py::TestAutograd::test_grad_fn_prehooks, test/test_autograd.py::TestAutograd::test_grad_fn_prehooks_multiple_outputs, test/test_autograd.py::TestAutograd::test_grad_fn_prehooks_remove_hooks, test/test_autograd.py::TestAutograd::test_grad_materialize_grads, test/test_autograd.py::TestAutograd::test_grad_mode_class_decoration, test/test_autograd.py::TestAutograd::test_grad_mode_restored_reentrant, test/test_autograd.py::TestAutograd::test_grad_nonleaf, test/test_autograd.py::TestAutograd::test_grad_nonleaf_many_outputs, test/test_autograd.py::TestAutograd::test_grad_nonleaf_register_hook, test/test_autograd.py::TestAutograd::test_grad_to_node, test/test_autograd.py::TestAutograd::test_grad_to_node_inplace, test/test_autograd.py::TestAutograd::test_grad_to_node_materialize, test/test_autograd.py::TestAutograd::test_grad_to_node_multi, test/test_autograd.py::TestAutograd::test_grad_to_node_set, test/test_autograd.py::TestAutograd::test_grad_unreachable, test/test_autograd.py::TestAutograd::test_grad_unreachable_discovery, test/test_autograd.py::TestAutograd::test_gradcheck_backward_mul_by_grad_output, test/test_autograd.py::TestAutograd::test_gradcheck_check_batched_grad, test/test_autograd.py::TestAutograd::test_gradcheck_check_forward_or_backward_only, test/test_autograd.py::TestAutograd::test_gradcheck_check_no_differentiable_outputs, test/test_autograd.py::TestAutograd::test_gradcheck_complex_non_complex_outputs, test/test_autograd.py::TestAutograd::test_gradcheck_custom_error, test/test_autograd.py::TestAutograd::test_gradcheck_default_device_placement_context, test/test_autograd.py::TestAutograd::test_gradcheck_dense_and_sparse_inputs, test/test_autograd.py::TestAutograd::test_gradcheck_forward_ad, test/test_autograd.py::TestAutograd::test_gradcheck_forward_ad_batched_grad, test/test_autograd.py::TestAutograd::test_gradcheck_forward_ad_respects_requires_grad, test/test_autograd.py::TestAutograd::test_gradcheck_forward_ad_runs_with_no_requires_grad, test/test_autograd.py::TestAutograd::test_gradcheck_get_analytical_jacobian, test/test_autograd.py::TestAutograd::test_gradcheck_get_numerical_jacobian, test/test_autograd.py::TestAutograd::test_gradcheck_input_layout0, test/test_autograd.py::TestAutograd::test_gradcheck_input_layout1, test/test_autograd.py::TestAutograd::test_gradcheck_input_layout2, test/test_autograd.py::TestAutograd::test_gradcheck_input_layout3, test/test_autograd.py::TestAutograd::test_gradcheck_input_layout4, test/test_autograd.py::TestAutograd::test_gradcheck_jacobian_mismatch, test/test_autograd.py::TestAutograd::test_gradcheck_multiple_mkldnn_inputs, test/test_autograd.py::TestAutograd::test_gradcheck_nondeterministic, test/test_autograd.py::TestAutograd::test_gradcheck_output_shape_or_dtype_depend_on_values, test/test_autograd.py::TestAutograd::test_gradcheck_single_input, test/test_autograd.py::TestAutograd::test_gradcheck_test_outputs, test/test_autograd.py::TestAutograd::test_gradcheck_undefined_grad, test/test_autograd.py::TestAutograd::test_gradcheck_validates_input_mkldnn, test/test_autograd.py::TestAutograd::test_gradcheck_validates_inputs, test/test_autograd.py::TestAutograd::test_graph_save_on_cpu, test/test_autograd.py::TestAutograd::test_graph_save_on_cpu_cuda, test/test_autograd.py::TestAutograd::test_hessian_vector, test/test_autograd.py::TestAutograd::test_hook_closure_cycle_use_custom_function_False_use_tensor_hook_False, test/test_autograd.py::TestAutograd::test_hook_closure_cycle_use_custom_function_False_use_tensor_hook_True, test/test_autograd.py::TestAutograd::test_hook_closure_cycle_use_custom_function_True_use_tensor_hook_False, test/test_autograd.py::TestAutograd::test_hook_closure_cycle_use_custom_function_True_use_tensor_hook_True, test/test_autograd.py::TestAutograd::test_hook_edge_case_when_called_with_grad, test/test_autograd.py::TestAutograd::test_hook_none, test/test_autograd.py::TestAutograd::test_hook_with_no_name, test/test_autograd.py::TestAutograd::test_hooks, test/test_autograd.py::TestAutograd::test_hooks_cpp, test/test_autograd.py::TestAutograd::test_increment_version, test/test_autograd.py::TestAutograd::test_index_backward_does_not_save_tensor, test/test_autograd.py::TestAutograd::test_indexing, test/test_autograd.py::TestAutograd::test_indexing_duplicates, test/test_autograd.py::TestAutograd::test_inplace, test/test_autograd.py::TestAutograd::test_inplace_not_requires_grad, test/test_autograd.py::TestAutograd::test_inplace_on_view_backward, test/test_autograd.py::TestAutograd::test_inplace_on_view_leaf_errors, test/test_autograd.py::TestAutograd::test_inplace_on_view_saved_output, test/test_autograd.py::TestAutograd::test_inplace_on_view_weak_grad_fn, test/test_autograd.py::TestAutograd::test_input_buffer_accum, test/test_autograd.py::TestAutograd::test_integer_outputs, test/test_autograd.py::TestAutograd::test_invalid_gradients, test/test_autograd.py::TestAutograd::test_isolated_node, test/test_autograd.py::TestAutograd::test_leaf_assignment, test/test_autograd.py::TestAutograd::test_legacy_function_deprecation_exception, test/test_autograd.py::TestAutograd::test_lobpcg, test/test_autograd.py::TestAutograd::test_mark_non_differentiable, test/test_autograd.py::TestAutograd::test_mark_non_differentiable_mixed, test/test_autograd.py::TestAutograd::test_mark_non_differentiable_none, test/test_autograd.py::TestAutograd::test_materialize_grads, test/test_autograd.py::TestAutograd::test_multi_backward, test/test_autograd.py::TestAutograd::test_multi_backward_no_grad, test/test_autograd.py::TestAutograd::test_multi_grad_all_hooks, test/test_autograd.py::TestAutograd::test_multi_grad_any_hooks, test/test_autograd.py::TestAutograd::test_multi_grad_hooks_invalid_mode, test/test_autograd.py::TestAutograd::test_named_tensor_for_complex_views, test/test_autograd.py::TestAutograd::test_naughty_anomaly_access, test/test_autograd.py::TestAutograd::test_naughty_autograd_function_attribute_access, test/test_autograd.py::TestAutograd::test_naughty_autograd_function_stashing_ctx, test/test_autograd.py::TestAutograd::test_nested_anomaly_detect_nan, test/test_autograd.py::TestAutograd::test_nested_anomaly_printstack_cleanup, test/test_autograd.py::TestAutograd::test_next_functions, test/test_autograd.py::TestAutograd::test_no_grad, test/test_autograd.py::TestAutograd::test_no_grad_assignment, test/test_autograd.py::TestAutograd::test_no_grad_copy, test/test_autograd.py::TestAutograd::test_no_grad_copy_sparse, test/test_autograd.py::TestAutograd::test_no_grad_input, test/test_autograd.py::TestAutograd::test_no_grad_modifies_version, test/test_autograd.py::TestAutograd::test_no_grad_python_function, test/test_autograd.py::TestAutograd::test_no_requires_grad_inplace, test/test_autograd.py::TestAutograd::test_no_unnecessary_save, test/test_autograd.py::TestAutograd::test_no_unnecessary_unwrapping, test/test_autograd.py::TestAutograd::test_not_implemented_fwad, test/test_autograd.py::TestAutograd::test_not_implemented_grad, test/test_autograd.py::TestAutograd::test_numpy_requires_grad, test/test_autograd.py::TestAutograd::test_once_differentiable, test/test_autograd.py::TestAutograd::test_out_variant_raises_when_inputs_require_grad, test/test_autograd.py::TestAutograd::test_pack_hook_with_inplace_modification_should_fail, test/test_autograd.py::TestAutograd::test_pickle, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_e2e, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_gets_cleaned_up, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_multiple_hooks, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_multiple_tensors, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_on_non_leaf, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_ordering, test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_returns_not_None, test/test_autograd.py::TestAutograd::test_pow_zero_tensor_gradient, test/test_autograd.py::TestAutograd::test_power_function, test/test_autograd.py::TestAutograd::test_prehook_ordering, test/test_autograd.py::TestAutograd::test_profiler, test/test_autograd.py::TestAutograd::test_profiler_aggregation_fake, test/test_autograd.py::TestAutograd::test_profiler_aggregation_lstm, test/test_autograd.py::TestAutograd::test_profiler_aggregation_table, test/test_autograd.py::TestAutograd::test_profiler_function_event_avg, test/test_autograd.py::TestAutograd::test_profiler_propagation, test/test_autograd.py::TestAutograd::test_profiler_seq_nr, test/test_autograd.py::TestAutograd::test_profiler_shapes, test/test_autograd.py::TestAutograd::test_profiler_unboxed_only, test/test_autograd.py::TestAutograd::test_pynode_destruction_deadlock, test/test_autograd.py::TestAutograd::test_record_function, test/test_autograd.py::TestAutograd::test_record_function_callbacks, test/test_autograd.py::TestAutograd::test_record_function_legacy, test/test_autograd.py::TestAutograd::test_record_function_multithreaded, test/test_autograd.py::TestAutograd::test_reentrant_child_error, test/test_autograd.py::TestAutograd::test_reentrant_priority, test/test_autograd.py::TestAutograd::test_reentrant_with_callbacks_both_depths, test/test_autograd.py::TestAutograd::test_reentrant_with_callbacks_depth_0, test/test_autograd.py::TestAutograd::test_reentrant_with_callbacks_depth_1, test/test_autograd.py::TestAutograd::test_reentrant_with_leaf_variable_hook, test/test_autograd.py::TestAutograd::test_reentrant_with_non_leaf_variable_hook, test/test_autograd.py::TestAutograd::test_requires_grad, test/test_autograd.py::TestAutograd::test_requires_grad_, test/test_autograd.py::TestAutograd::test_requires_grad_inplace, test/test_autograd.py::TestAutograd::test_retain_grad, test/test_autograd.py::TestAutograd::test_retain_grad_cycle, test/test_autograd.py::TestAutograd::test_retain_grad_inplace, test/test_autograd.py::TestAutograd::test_retain_grad_inplace_over_view, test/test_autograd.py::TestAutograd::test_retains_grad_can_always_observe_tensor_prehook, test/test_autograd.py::TestAutograd::test_retains_grad_inplace_multiple_outputs, test/test_autograd.py::TestAutograd::test_return_duplicate, test/test_autograd.py::TestAutograd::test_return_duplicate_inplace, test/test_autograd.py::TestAutograd::test_return_leaf, test/test_autograd.py::TestAutograd::test_return_leaf_inplace, test/test_autograd.py::TestAutograd::test_save_none_for_backward, test/test_autograd.py::TestAutograd::test_save_on_cpu_and_checkpoint, test/test_autograd.py::TestAutograd::test_save_output_nr, test/test_autograd.py::TestAutograd::test_saved_tensor_hooks_custom_error_propagaation, test/test_autograd.py::TestAutograd::test_saved_tensor_hooks_custom_function_intermediates, test/test_autograd.py::TestAutograd::test_saved_variable_packing_unpacking_did_not_save_original_with_default_hooks, test/test_autograd.py::TestAutograd::test_saved_variable_packing_unpacking_did_not_save_original_with_hooks, test/test_autograd.py::TestAutograd::test_saved_variable_packing_unpacking_saved_original_with_default_hooks, test/test_autograd.py::TestAutograd::test_saved_variable_packing_unpacking_saved_original_with_hooks, test/test_autograd.py::TestAutograd::test_saved_variable_saved_original_inplace_detach, test/test_autograd.py::TestAutograd::test_saved_variable_version_counter, test/test_autograd.py::TestAutograd::test_saved_variables_deprecated, test/test_autograd.py::TestAutograd::test_saving_variable_to_disk, test/test_autograd.py::TestAutograd::test_select_expanded_v, test/test_autograd.py::TestAutograd::test_select_sum, test/test_autograd.py::TestAutograd::test_set_data_preserve_pyobj, test/test_autograd.py::TestAutograd::test_set_data_self_requires_grad, test/test_autograd.py::TestAutograd::test_set_data_tensorimpl_type, test/test_autograd.py::TestAutograd::test_set_grad_coroutines, test/test_autograd.py::TestAutograd::test_set_grad_coroutines_benign_exceptions, test/test_autograd.py::TestAutograd::test_set_grad_coroutines_critical_exceptions, test/test_autograd.py::TestAutograd::test_set_grad_coroutines_exit, test/test_autograd.py::TestAutograd::test_set_grad_enabled, test/test_autograd.py::TestAutograd::test_set_grad_enabled_wraps, test/test_autograd.py::TestAutograd::test_set_grad_generator_functions, test/test_autograd.py::TestAutograd::test_set_grad_generator_functions_recursive, test/test_autograd.py::TestAutograd::test_set_materialize_non_diff_grads, test/test_autograd.py::TestAutograd::test_setitem, test/test_autograd.py::TestAutograd::test_setitem_mask, test/test_autograd.py::TestAutograd::test_setting_default_saved_variable_hooks_twice_should_not_fail, test/test_autograd.py::TestAutograd::test_setting_default_saved_variable_hooks_twice_should_use_inner, test/test_autograd.py::TestAutograd::test_setup_context_when_forward_has_default_args, test/test_autograd.py::TestAutograd::test_shape, test/test_autograd.py::TestAutograd::test_sharded_grad, test/test_autograd.py::TestAutograd::test_simple_reentrant, test/test_autograd.py::TestAutograd::test_slice_expanded_v, test/test_autograd.py::TestAutograd::test_sparse_gather_both_scalar, test/test_autograd.py::TestAutograd::test_sparse_gather_dim0, test/test_autograd.py::TestAutograd::test_sparse_gather_dim1, test/test_autograd.py::TestAutograd::test_sparse_gather_dim_neg, test/test_autograd.py::TestAutograd::test_sparse_gather_ind_scalar, test/test_autograd.py::TestAutograd::test_sparse_gather_x_scalar, test/test_autograd.py::TestAutograd::test_sparse_mm_backward, test/test_autograd.py::TestAutograd::test_tensor_grad_warnings, test/test_autograd.py::TestAutograd::test_tensor_hooks_inplace, test/test_autograd.py::TestAutograd::test_tensor_hooks_inplace_multiple_outputs, test/test_autograd.py::TestAutograd::test_tensor_hooks_inplace_over_view, test/test_autograd.py::TestAutograd::test_thread_shutdown, test/test_autograd.py::TestAutograd::test_to_sparse_backward, test/test_autograd.py::TestAutograd::test_too_many_grads, test/test_autograd.py::TestAutograd::test_type_conversions, test/test_autograd.py::TestAutograd::test_unrelated_inputs, test/test_autograd.py::TestAutograd::test_unsafe_set_version_counter, test/test_autograd.py::TestAutograd::test_unused_output, test/test_autograd.py::TestAutograd::test_var_mean_differentiable, test/test_autograd.py::TestAutograd::test_variable_traverse, test/test_autograd.py::TestAutograd::test_version_counter, test/test_autograd.py::TestAutograd::test_view_func_replay, test/test_autograd.py::TestAutograd::test_view_func_replay_with_modified_state, test/test_autograd.py::TestAutograd::test_view_replay_enabled, test/test_autograd.py::TestAutograd::test_volatile_deprecated, test/test_autograd.py::TestAutograd::test_will_engine_execute_node, test/test_autograd.py::TestAutograd::test_wrapped_number_saved_variable_hooks, test/test_autograd.py::TestAutogradForwardModeBatchedGrad::test_inplace_on_view_not_same_layout, test/test_autograd.py::TestAutogradForwardModeBatchedGrad::test_inplace_on_view_same_layout, test/test_autograd.py::TestAutogradForwardModeBatchedGrad::test_metadata_check_for_storage_numel_skipped, test/test_autograd.py::TestAutogradForwardModeBatchedGrad::test_out_of_place_basic, test/test_autograd.py::TestAutogradForwardModeBatchedGrad::test_out_of_place_not_same_layout, test/test_autograd.py::TestAutogradForwardMode::test_advanced_packing_unpacking, test/test_autograd.py::TestAutogradForwardMode::test_backward_graph_destruction, test/test_autograd.py::TestAutogradForwardMode::test_basic_packing_unpacking, test/test_autograd.py::TestAutogradForwardMode::test_codegen_ignores_undefined_outputs, test/test_autograd.py::TestAutogradForwardMode::test_create_new_zeros_with_same_meta, test/test_autograd.py::TestAutogradForwardMode::test_default_level, test/test_autograd.py::TestAutogradForwardMode::test_detach_view_tracking, test/test_autograd.py::TestAutogradForwardMode::test_forward_level_cleanup, test/test_autograd.py::TestAutogradForwardMode::test_fwd_grad_enabled, test/test_autograd.py::TestAutogradForwardMode::test_grad_cleanup, test/test_autograd.py::TestAutogradForwardMode::test_make_dual_forbid_integral_dtype, test/test_autograd.py::TestAutogradForwardMode::test_make_dual_inference_tensor_in_inference_mode, test/test_autograd.py::TestAutogradForwardMode::test_make_dual_torch_dispatch, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_check_conj, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_checks_ignores_size_zero, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_checks_storage_numel, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_ignore_storage_offset_for_zero_numel_tensor, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_when_primal_has_conj_bit, test/test_autograd.py::TestAutogradForwardMode::test_metadata_check_when_primal_has_neg_bit, test/test_autograd.py::TestAutogradForwardMode::test_nested_level, test/test_autograd.py::TestAutogradForwardMode::test_non_differentiable, test/test_autograd.py::TestAutogradForwardMode::test_out_variant, test/test_autograd.py::TestAutogradForwardMode::test_print, test/test_autograd.py::TestAutogradForwardMode::test_set_fw_grad_having_own_fw_grad_at_same_level, test/test_autograd.py::TestAutogradForwardMode::test_set_fwd_grad_enabled, test/test_autograd.py::TestAutogradForwardMode::test_size_check, test/test_autograd.py::TestAutogradForwardMode::test_view_inplace_always_creates_a_view, test/test_autograd.py::TestAutogradForwardMode::test_view_inplace_differentiable_views, test/test_autograd.py::TestAutogradForwardMode::test_view_inplace_non_differentiable_views, test/test_autograd.py::TestAllowMutationOnSaved::test_backward_out_of_context, test/test_autograd.py::TestAllowMutationOnSaved::test_basic, test/test_autograd.py::TestAllowMutationOnSaved::test_disallow_nesting, test/test_autograd.py::TestAllowMutationOnSaved::test_double_backward, test/test_autograd.py::TestAllowMutationOnSaved::test_save_base_and_modify_view, test/test_autograd.py::TestAllowMutationOnSaved::test_save_view_modify_base, test/test_autograd.py::TestAllowMutationOnSaved::test_saved_but_not_anymore, test/test_autograd.py::TestAllowMutationOnSaved::test_saved_same_tensor_different_versions, test/test_autograd.py::TestAllowMutationOnSaved::test_saved_same_tensor_many_times, test/test_autograd.py::TestAllowMutationOnSaved::test_views, test/test_autograd.py::TestAllowMutationOnSaved::test_with_math_views, test/test_autograd.py::TestAllowMutationOnSaved::test_with_out_variant, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_context_manager, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_decorator, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_existing_autograd_session, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_handle_direct_view_on_rebase, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_handle_indirect_view_on_rebase, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_inf_mode_functional_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_inf_mode_inplace_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_inf_mode_view_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_normal_mode_functional_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_normal_mode_inplace_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_inf_tensor_in_normal_mode_view_op, test/test_autograd.py::TestAutogradInferenceMode::test_inference_mode_tensor_creation, test/test_autograd.py::TestAutogradInferenceMode::test_mix_inference_and_normal_tensor_functional_op, test/test_autograd.py::TestAutogradInferenceMode::test_mix_inference_and_normal_tensor_inplace_op, test/test_autograd.py::TestAutogradInferenceMode::test_mix_inference_and_normal_tensor_view_op, test/test_autograd.py::TestAutogradInferenceMode::test_normal_tensor_inplace_output_in_inference_mode, test/test_autograd.py::TestAutogradInferenceMode::test_normal_tensor_inplace_output_in_normal_mode, test/test_autograd.py::TestAutogradInferenceMode::test_normal_tensor_view_output_in_inference_mode, test/test_autograd.py::TestAutogradInferenceMode::test_normal_tensor_view_output_in_normal_mode, test/test_autograd.py::TestMultithreadAutograd::test_cat_stack_r_to_c, test/test_autograd.py::TestMultithreadAutograd::test_custom_function_propagates_errors_from_device_thread, test/test_autograd.py::TestMultithreadAutograd::test_dataparallel_saved_tensors_hooks, test/test_autograd.py::TestMultithreadAutograd::test_fork_join_in_middle, test/test_autograd.py::TestMultithreadAutograd::test_multi_grad_all_hooks, test/test_autograd.py::TestMultithreadAutograd::test_multi_grad_any_hooks, test/test_autograd.py::TestMultithreadAutograd::test_multithreaded_exception_propagation, test/test_autograd.py::TestMultithreadAutograd::test_preserve_backtrace, test/test_autograd.py::TestMultithreadAutograd::test_python_thread_in_middle, test/test_autograd.py::TestMultithreadAutograd::test_set_multithreading_enabled_as_context_manager_and_function, test/test_autograd.py::TestMultithreadAutograd::test_simple_backward, test/test_autograd.py::TestMultithreadAutograd::test_simple_backward_same_input, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_early_stop_True, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_kwargs_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_kwargs_early_stop_True, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_non_tensor_inputs_and_outputs_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_non_tensor_inputs_and_outputs_early_stop_True, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_reentrant_backwards_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_reentrant_backwards_early_stop_True, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_same_graph_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_same_graph_early_stop_True, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_set_early_stop, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_set_early_stop_no_recompution_needed, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_two_children_early_stop_False, test/test_autograd.py::TestNestedCheckpoint::test_nested_checkpoint_two_children_early_stop_True, test/test_autograd.py::TestAutogradComplex::test_view_func_for_complex_views, test/test_autograd.py::TestAutogradComplex::test_view_with_multi_output, test/test_autograd.py::TestAutogradFunctional::test_construct_standard_basis_for_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_construct_standard_basis_for_cuda_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_construct_standard_basis_for_cuda_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_construct_standard_basis_for_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_create_graph_vectorize_False_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_create_graph_vectorize_False_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_create_graph_vectorize_True_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_create_graph_vectorize_True_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_strict_vectorize_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_strict_vectorize_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_vectorize_False_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_vectorize_False_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_vectorize_True_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_err_check_vectorize_True_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_match_vhp_hvp_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_match_vhp_hvp_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_output_vectorized_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_output_vectorized_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_scalar_vectorize_False_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_scalar_vectorize_False_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_scalar_vectorize_True_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_scalar_vectorize_True_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_multi_input_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_multi_input_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_simple_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_simple_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_unrelated_outputs_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_correctness_unrelated_outputs_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_raises_no_warnings_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hessian_vectorize_raises_no_warnings_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_create_graph_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_create_graph_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_err_check_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_err_check_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_scalar_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_hvp_scalar_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_create_graph_vectorize_False_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_create_graph_vectorize_False_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_create_graph_vectorize_True_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_create_graph_vectorize_True_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_strict_vectorize_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_strict_vectorize_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_vectorize_False_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_vectorize_False_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_vectorize_True_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_err_check_vectorize_True_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_match_vjp_jvp_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_match_vjp_jvp_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_output_vectorized_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_output_vectorized_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_scalar_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_scalar_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_scalar_vectorized_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_scalar_vectorized_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_different_devices_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_different_devices_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_different_dtype_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_different_dtype_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_multi_input_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_multi_input_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_multi_input_multi_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_multi_input_multi_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_simple_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_simple_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_unrelated_outputs_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_unrelated_outputs_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_zero_dim_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_correctness_zero_dim_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_raises_no_warnings_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jacobian_vectorize_raises_no_warnings_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_create_graph_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_create_graph_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_err_check_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_err_check_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_scalar_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_jvp_scalar_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_create_graph_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_create_graph_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_err_check_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_err_check_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_scalar_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vhp_scalar_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_create_graph_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_create_graph_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_err_check_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_err_check_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_err_check_strict_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_err_check_strict_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_no_grad_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_no_grad_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_output_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_output_logging_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_scalar_base_tensor, test/test_autograd.py::TestAutogradFunctional::test_vjp_scalar_logging_tensor, test/test_autograd.py::TestAutogradLogging::test_logging, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_advanced_indexing_backwards_large_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_advanced_indexing_backwards_memory_format_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_backward_device_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_complex_scalar_backward_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_copy__cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_copy_forward_ad_broadcasting_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_copy_forward_ad_same_layout_copies_grad_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_copy_r_to_c_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_cross_device_reentrant_autograd_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_free_unneeded_tensor_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_grad_assignment_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_gradcheck_input_output_different_device_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_multiple_output_view_of_view_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_backprop_base_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_backprop_view_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_backprop_view_of_view_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_gradcheck_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_makes_base_require_grad_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_modify_base_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_multi_output_safe_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_multi_output_unsafe_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_multiple_outputs_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_non_contig_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_of_multiple_output_view_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_of_view_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_python_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_then_no_grad_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inplace_on_view_undefined_grad_output_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_inputbuffer_add_multidevice_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_min_max_median_backprops_to_all_values_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_mv_grad_stride_0_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_non_differentiable_ops_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_parameter_resize_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_pin_memory_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_pow_real_negative_base_complex_exponent_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_profiler_emit_itt_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_profiler_emit_nvtx_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_pyscalar_conversions_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_reentrant_parent_error_on_cpu_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_requires_grad_factory_cuda_float32, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_requires_grad_factory_cuda_float64, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_resize_version_bump_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_rnn_backward_to_input_but_not_parameters_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_scatter_index_reduce_amin_amax_backprops_to_all_values_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_scatter_index_reduce_prod_gradgrad_error_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_float16, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_float32, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_float64, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_int16, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_int32, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_int64, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_set_requires_grad_only_for_floats_cuda_int8, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_simple_reentrant_cross_device_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_sparse_backward_cuda_complex128, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_sparse_backward_cuda_float64, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_sparse_ctor_getter_backward_cuda_complex128, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_sparse_ctor_getter_backward_cuda_float64, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_sparse_mask_autograd_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_strided_leaf_grad_layout_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_to_r_to_c_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_unused_output_device_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_warning_in_backward_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_where_functional_cuda, test/test_autograd.py::TestAutogradDeviceTypeCUDA::test_where_scalar_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_autograd_composite_implicit_and_dispatch_registration_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_autograd_multiple_dispatch_registrations_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_backward_single_threaded_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_backward_tls_stash_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_foward_mode_AD_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_per_dispatch_key_input_saving_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_set_sequence_nr_cuda, test/test_autograd.py::TestAutogradMultipleDispatchCUDA::test_view_copy_cuda 2024-04-03T11:59:58.9493040Z 2024-04-03T11:59:58.9493415Z Running test_cuda 1/1 ... [2024-04-03 11:59:58.887430] 2024-04-03T11:59:58.9495355Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_cuda.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 11:59:58.888291] 2024-04-03T12:01:13.8162723Z 2024-04-03T12:01:13.8171491Z test_cuda 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_cuda_1.1_be5818e4c77f9677_.log 2024-04-03T12:01:13.8293936Z Running 149 items in this shard: test/test_cuda.py::TestCuda::test_arithmetic_large_tensor, test/test_cuda.py::TestCuda::test_autocast_banned, test/test_cuda.py::TestCuda::test_autocast_cache_leak, test/test_cuda.py::TestCuda::test_autocast_cat_jit, test/test_cuda.py::TestCuda::test_autocast_checkpointing, test/test_cuda.py::TestCuda::test_autocast_custom_cast_inputs, test/test_cuda.py::TestCuda::test_autocast_custom_enabled, test/test_cuda.py::TestCuda::test_autocast_ignored_types, test/test_cuda.py::TestCuda::test_autocast_linalg_fp16, test/test_cuda.py::TestCuda::test_autocast_methods_expect_builtin_promote, test/test_cuda.py::TestCuda::test_autocast_methods_fp16, test/test_cuda.py::TestCuda::test_autocast_methods_fp32, test/test_cuda.py::TestCuda::test_autocast_nn_bf16, test/test_cuda.py::TestCuda::test_autocast_nn_fp16, test/test_cuda.py::TestCuda::test_autocast_nn_fp32, test/test_cuda.py::TestCuda::test_autocast_rnn, test/test_cuda.py::TestCuda::test_autocast_torch_bf16, test/test_cuda.py::TestCuda::test_autocast_torch_expect_builtin_promote, test/test_cuda.py::TestCuda::test_autocast_torch_fp16, test/test_cuda.py::TestCuda::test_autocast_torch_fp32, test/test_cuda.py::TestCuda::test_autocast_torch_need_autocast_promote, test/test_cuda.py::TestCuda::test_batch_norm_gather_stats, test/test_cuda.py::TestCuda::test_bincount_ext, test/test_cuda.py::TestCuda::test_caching_allocator_record_stream_oom, test/test_cuda.py::TestCuda::test_caching_pinned_memory, test/test_cuda.py::TestCuda::test_check_error, test/test_cuda.py::TestCuda::test_copy_non_blocking, test/test_cuda.py::TestCuda::test_copy_non_blocking_type_conversion, test/test_cuda.py::TestCuda::test_cublas_allow_bf16_reduced_precision_reduction_get_set, test/test_cuda.py::TestCuda::test_cublas_allow_fp16_reduced_precision_reduction_get_set, test/test_cuda.py::TestCuda::test_cublas_allow_tf32_get_set, test/test_cuda.py::TestCuda::test_cublas_multiple_threads_same_device, test/test_cuda.py::TestCuda::test_cublas_workspace_explicit_allocation, test/test_cuda.py::TestCuda::test_cuda_get_device_capability, test/test_cuda.py::TestCuda::test_cuda_get_device_name, test/test_cuda.py::TestCuda::test_cuda_graph_allocator_propagates_stream, test/test_cuda.py::TestCuda::test_cuda_graph_error_options, test/test_cuda.py::TestCuda::test_cuda_kernel_loop_overflow, test/test_cuda.py::TestCuda::test_cuda_kernel_loop_overflow_large, test/test_cuda.py::TestCuda::test_cuda_memory_leak_detection_propagates_errors, test/test_cuda.py::TestCuda::test_cudart_register, test/test_cuda.py::TestCuda::test_cudnn_allow_tf32_get_set, test/test_cuda.py::TestCuda::test_cudnn_multiple_threads_same_device, test/test_cuda.py::TestCuda::test_cusparse_multiple_threads_same_device, test/test_cuda.py::TestCuda::test_device_count_not_cached_pre_init, test/test_cuda.py::TestCuda::test_events, test/test_cuda.py::TestCuda::test_fixed_cuda_assert_async, test/test_cuda.py::TestCuda::test_float32_matmul_precision_get_set, test/test_cuda.py::TestCuda::test_gather_bool, test/test_cuda.py::TestCuda::test_get_device_index, test/test_cuda.py::TestCuda::test_grad_scaling_autocast_fused_optimizers, test/test_cuda.py::TestCuda::test_graph_capture_oom, test/test_cuda.py::TestCuda::test_graph_capture_reset_recapture, test/test_cuda.py::TestCuda::test_graph_capture_simple, test/test_cuda.py::TestCuda::test_graph_concurrent_replay, test/test_cuda.py::TestCuda::test_graph_cudnn_dropout, test/test_cuda.py::TestCuda::test_graph_error, test/test_cuda.py::TestCuda::test_graph_grad_scaling, test/test_cuda.py::TestCuda::test_graph_is_current_stream_capturing, test/test_cuda.py::TestCuda::test_graph_make_graphed_callables_same_pool, test/test_cuda.py::TestCuda::test_graph_make_graphed_callables_with_amp_cache_disabled_allow_unused_input, test/test_cuda.py::TestCuda::test_graph_make_graphed_callables_with_amp_cache_enabled_allow_unused_input, test/test_cuda.py::TestCuda::test_graph_make_graphed_callables_without_amp_allow_unused_input, test/test_cuda.py::TestCuda::test_graph_make_graphed_callables_without_amp_not_allow_unused_input, test/test_cuda.py::TestCuda::test_graph_memory_stats_and_use_result_after_destroy_graph, test/test_cuda.py::TestCuda::test_graph_optims, test/test_cuda.py::TestCuda::test_graph_optims_with_explicitly_capturable_param_groups, test/test_cuda.py::TestCuda::test_graph_record_stream, test/test_cuda.py::TestCuda::test_graph_rng_distributions, test/test_cuda.py::TestCuda::test_graph_rng_functional, test/test_cuda.py::TestCuda::test_graph_scaling_fused_optimizers, test/test_cuda.py::TestCuda::test_graph_three_successive, test/test_cuda.py::TestCuda::test_graph_two_successive, test/test_cuda.py::TestCuda::test_graph_warn_if_has_zero_nodes, test/test_cuda.py::TestCuda::test_graphsafe_set_get_rng_state, test/test_cuda.py::TestCuda::test_huge_index, test/test_cuda.py::TestCuda::test_index_out_of_bounds_exception_cuda, test/test_cuda.py::TestCuda::test_lazy_init, test/test_cuda.py::TestCuda::test_manual_seed, test/test_cuda.py::TestCuda::test_matmul_device_mismatch, test/test_cuda.py::TestCuda::test_matmul_memory_use, test/test_cuda.py::TestCuda::test_max_large_axis, test/test_cuda.py::TestCuda::test_mean_fp16, test/test_cuda.py::TestCuda::test_memory_allocation, test/test_cuda.py::TestCuda::test_memory_stats_of_multiple_generators_and_graphs, test/test_cuda.py::TestCuda::test_min_max_inits, test/test_cuda.py::TestCuda::test_multinomial_ext, test/test_cuda.py::TestCuda::test_multinomial_invalid_probs_cuda, test/test_cuda.py::TestCuda::test_noncontiguous_pinned_memory, test/test_cuda.py::TestCuda::test_norm_type_conversion, test/test_cuda.py::TestCuda::test_nvtx, test/test_cuda.py::TestCuda::test_out_of_memory, test/test_cuda.py::TestCuda::test_out_of_memory_retry, test/test_cuda.py::TestCuda::test_pinned_memory_with_cudaregister, test/test_cuda.py::TestCuda::test_pinned_memory_with_cudaregister_multithread, test/test_cuda.py::TestCuda::test_prod_large, test/test_cuda.py::TestCuda::test_record_stream, test/test_cuda.py::TestCuda::test_record_stream_on_shifted_view, test/test_cuda.py::TestCuda::test_reduction_gpu_memory_accessing, test/test_cuda.py::TestCuda::test_repeat_graph_capture_cublas_workspace_memory, test/test_cuda.py::TestCuda::test_rocm_backward_pass_guard, test/test_cuda.py::TestCuda::test_serialization_array_with_empty, test/test_cuda.py::TestCuda::test_serialization_array_with_storage, test/test_cuda.py::TestCuda::test_set_per_process_memory_fraction, test/test_cuda.py::TestCuda::test_specify_improper_device_name, test/test_cuda.py::TestCuda::test_stream_event_repr, test/test_cuda.py::TestCuda::test_streaming_backwards_callback, test/test_cuda.py::TestCuda::test_streaming_backwards_multiple_streams, test/test_cuda.py::TestCuda::test_streaming_backwards_sync, test/test_cuda.py::TestCuda::test_streaming_backwards_sync_graph_root, test/test_cuda.py::TestCuda::test_streams, test/test_cuda.py::TestCuda::test_sum_fp16, test/test_cuda.py::TestCuda::test_tiny_half_norm_, test/test_cuda.py::TestCuda::test_to_cpu_blocking_by_default, test/test_cuda.py::TestCuda::test_to_non_blocking, test/test_cuda.py::TestCuda::test_to_numpy, test/test_cuda.py::TestCuda::test_torch_manual_seed_seeds_cuda_devices, test/test_cuda.py::TestCuda::test_type_conversions, test/test_cuda.py::TestCudaMallocAsync::test_allocator_fuzz, test/test_cuda.py::TestCudaMallocAsync::test_allocator_settings, test/test_cuda.py::TestCudaMallocAsync::test_clock_speed, test/test_cuda.py::TestCudaMallocAsync::test_cpp_memory_snapshot_pickle, test/test_cuda.py::TestCudaMallocAsync::test_cycles, test/test_cuda.py::TestCudaMallocAsync::test_direct_traceback, test/test_cuda.py::TestCudaMallocAsync::test_memory_plots, test/test_cuda.py::TestCudaMallocAsync::test_memory_plots_free_segment_stack, test/test_cuda.py::TestCudaMallocAsync::test_memory_plots_free_stack, test/test_cuda.py::TestCudaMallocAsync::test_memory_plots_history_context, test/test_cuda.py::TestCudaMallocAsync::test_memory_profiler_viz, test/test_cuda.py::TestCudaMallocAsync::test_memory_snapshot, test/test_cuda.py::TestCudaMallocAsync::test_memory_snapshot_script, test/test_cuda.py::TestCudaMallocAsync::test_memory_snapshot_with_cpp, test/test_cuda.py::TestCudaMallocAsync::test_notifies_oom, test/test_cuda.py::TestCudaMallocAsync::test_nvml_get_handler, test/test_cuda.py::TestCudaMallocAsync::test_power_draw, test/test_cuda.py::TestCudaMallocAsync::test_raises_oom_max_split_size_mb_setting_False, test/test_cuda.py::TestCudaMallocAsync::test_raises_oom_max_split_size_mb_setting_True, test/test_cuda.py::TestCudaMallocAsync::test_temperature, test/test_cuda.py::TestBlockStateAbsorption::test_additional_free_following_checkpoint, test/test_cuda.py::TestBlockStateAbsorption::test_allocate_in_thread_to_pool, test/test_cuda.py::TestBlockStateAbsorption::test_allocated_in_middle_of_segment, test/test_cuda.py::TestBlockStateAbsorption::test_assigning_back_deleter_fns_to_tensor, test/test_cuda.py::TestBlockStateAbsorption::test_check_pool_live_allocations, test/test_cuda.py::TestBlockStateAbsorption::test_middle_allocations_contiguous, test/test_cuda.py::TestBlockStateAbsorption::test_multiple_middle_allocations, test/test_cuda.py::TestBlockStateAbsorption::test_no_triton_on_import, test/test_cuda.py::TestBlockStateAbsorption::test_resnet, test/test_cuda.py::TestBlockStateAbsorption::test_simple, test/test_cuda.py::TestBlockStateAbsorption::test_tensor_dies_after_checkpoint 2024-04-03T12:01:13.8347252Z 2024-04-03T12:01:13.8347621Z Running test_torch 1/1 ... [2024-04-03 12:01:13.816946] 2024-04-03T12:01:13.8349576Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_torch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:01:13.817852] 2024-04-03T12:03:59.1632928Z 2024-04-03T12:03:59.1640328Z test_torch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_torch_1.1_88a66d12edd14a59_.log 2024-04-03T12:03:59.2171473Z Running 1031 items in this shard: test/test_torch.py::TestBasicVitalSigns::test_basic_vitals, test/test_torch.py::TestBasicVitalSigns::test_basic_vitals_read_write, test/test_torch.py::TestBasicVitalSigns::test_dataloader_vitals, test/test_torch.py::TestTorch::test_RNGState, test/test_torch.py::TestTorch::test_RNGStateAliasing, test/test_torch.py::TestTorch::test_RNG_after_pickle, test/test_torch.py::TestTorch::test_Size, test/test_torch.py::TestTorch::test_Size_iter, test/test_torch.py::TestTorch::test_Size_scalar, test/test_torch.py::TestTorch::test_add_meta_scalar, test/test_torch.py::TestTorch::test_allow_tensor_metadata_change, test/test_torch.py::TestTorch::test_apply, test/test_torch.py::TestTorch::test_as_subclass, test/test_torch.py::TestTorch::test_assert_async, test/test_torch.py::TestTorch::test_backward_hooks_traverse, test/test_torch.py::TestTorch::test_batch_norm_cpu_inference, test/test_torch.py::TestTorch::test_bmm_multithreaded, test/test_torch.py::TestTorch::test_boxMullerState, test/test_torch.py::TestTorch::test_c10_layer_norm, test/test_torch.py::TestTorch::test_cat_neg_dim, test/test_torch.py::TestTorch::test_check, test/test_torch.py::TestTorch::test_chunk_neg_dim, test/test_torch.py::TestTorch::test_conj_neg_tolist, test/test_torch.py::TestTorch::test_contains, test/test_torch.py::TestTorch::test_copy_broadcast, test/test_torch.py::TestTorch::test_copy_dtypes, test/test_torch.py::TestTorch::test_copy_float16, test/test_torch.py::TestTorch::test_copy_many_to_one, test/test_torch.py::TestTorch::test_copy_transpose, test/test_torch.py::TestTorch::test_cuda_not_built, test/test_torch.py::TestTorch::test_cummax_neg_dim, test/test_torch.py::TestTorch::test_cummin_neg_dim, test/test_torch.py::TestTorch::test_cumprod_neg_dim, test/test_torch.py::TestTorch::test_cumsum_neg_dim, test/test_torch.py::TestTorch::test_cxx_flags, test/test_torch.py::TestTorch::test_data_ptr_of_empty_tensor_with_storage, test/test_torch.py::TestTorch::test_data_ptr_of_empty_view_with_storage, test/test_torch.py::TestTorch::test_deepcopy_gradient, test/test_torch.py::TestTorch::test_deepcopy_parameter, test/test_torch.py::TestTorch::test_deterministic_fill_uninitialized_memory, test/test_torch.py::TestTorch::test_deterministic_flag, test/test_torch.py::TestTorch::test_device, test/test_torch.py::TestTorch::test_dim_order, test/test_torch.py::TestTorch::test_dir, test/test_torch.py::TestTorch::test_doc, test/test_torch.py::TestTorch::test_doc_template, test/test_torch.py::TestTorch::test_dot_data_use, test/test_torch.py::TestTorch::test_dtype_is_signed, test/test_torch.py::TestTorch::test_element_size, test/test_torch.py::TestTorch::test_empty_meta, test/test_torch.py::TestTorch::test_empty_storage_view, test/test_torch.py::TestTorch::test_equal, test/test_torch.py::TestTorch::test_error_msg_type_translation, test/test_torch.py::TestTorch::test_fill_diagonal, test/test_torch.py::TestTorch::test_format_scalar_meta, test/test_torch.py::TestTorch::test_from_buffer, test/test_torch.py::TestTorch::test_from_file, test/test_torch.py::TestTorch::test_gather_neg_dim, test/test_torch.py::TestTorch::test_generator_cpu, test/test_torch.py::TestTorch::test_get_cpu_capability, test/test_torch.py::TestTorch::test_has_internal_overlap, test/test_torch.py::TestTorch::test_has_storage, test/test_torch.py::TestTorch::test_index_add, test/test_torch.py::TestTorch::test_index_add_all_dtypes, test/test_torch.py::TestTorch::test_index_add_cornercase, test/test_torch.py::TestTorch::test_index_add_correctness, test/test_torch.py::TestTorch::test_index_add_neg_dim, test/test_torch.py::TestTorch::test_index_copy_neg_dim, test/test_torch.py::TestTorch::test_index_fill_neg_dim, test/test_torch.py::TestTorch::test_index_select_neg_dim, test/test_torch.py::TestTorch::test_invalid_arg_error_handling, test/test_torch.py::TestTorch::test_invalid_generator_raises, test/test_torch.py::TestTorch::test_is_nonzero, test/test_torch.py::TestTorch::test_is_same_size, test/test_torch.py::TestTorch::test_iter, test/test_torch.py::TestTorch::test_kthvalue_neg_dim, test/test_torch.py::TestTorch::test_linspace_logspace, test/test_torch.py::TestTorch::test_logcumsumexp_neg_dim, test/test_torch.py::TestTorch::test_manual_seed, test/test_torch.py::TestTorch::test_map, test/test_torch.py::TestTorch::test_map2, test/test_torch.py::TestTorch::test_max_neg_dim, test/test_torch.py::TestTorch::test_mean_neg_dim, test/test_torch.py::TestTorch::test_median_neg_dim, test/test_torch.py::TestTorch::test_memory_format, test/test_torch.py::TestTorch::test_memory_format_contiguous_returns_same_tensor_if_already_satisfies, test/test_torch.py::TestTorch::test_memory_format_empty, test/test_torch.py::TestTorch::test_min_neg_dim, test/test_torch.py::TestTorch::test_mode_neg_dim, test/test_torch.py::TestTorch::test_multinomial_invalid_probs, test/test_torch.py::TestTorch::test_nanmedian_neg_dim, test/test_torch.py::TestTorch::test_narrow_neg_dim, test/test_torch.py::TestTorch::test_nbytes, test/test_torch.py::TestTorch::test_ndim, test/test_torch.py::TestTorch::test_new, test/test_torch.py::TestTorch::test_newaxis_numpy_comparison, test/test_torch.py::TestTorch::test_newindex, test/test_torch.py::TestTorch::test_no_cuda_monkeypatch, test/test_torch.py::TestTorch::test_norm_neg_dim, test/test_torch.py::TestTorch::test_normal_shape, test/test_torch.py::TestTorch::test_numel, test/test_torch.py::TestTorch::test_parallel_info, test/test_torch.py::TestTorch::test_parsing_double, test/test_torch.py::TestTorch::test_parsing_int64, test/test_torch.py::TestTorch::test_parsing_intlist, test/test_torch.py::TestTorch::test_permute, test/test_torch.py::TestTorch::test_pickle, test/test_torch.py::TestTorch::test_pickle_dtype, test/test_torch.py::TestTorch::test_pickle_function, test/test_torch.py::TestTorch::test_pickle_parameter, test/test_torch.py::TestTorch::test_pickle_parameter_no_requires_grad, test/test_torch.py::TestTorch::test_pickle_size, test/test_torch.py::TestTorch::test_pin_memory, test/test_torch.py::TestTorch::test_print, test/test_torch.py::TestTorch::test_prod_neg_dim, test/test_torch.py::TestTorch::test_pyobj_preserved, test/test_torch.py::TestTorch::test_qengine, test/test_torch.py::TestTorch::test_renorm_neg_dim, test/test_torch.py::TestTorch::test_resizable, test/test_torch.py::TestTorch::test_reversed, test/test_torch.py::TestTorch::test_scatter_neg_dim, test/test_torch.py::TestTorch::test_select_neg_dim, test/test_torch.py::TestTorch::test_set_flush_denormal, test/test_torch.py::TestTorch::test_setting_real_imag_to_a_number, test/test_torch.py::TestTorch::test_show_config, test/test_torch.py::TestTorch::test_size_neg_dim, test/test_torch.py::TestTorch::test_size_stride, test/test_torch.py::TestTorch::test_sizeof, test/test_torch.py::TestTorch::test_slice, test/test_torch.py::TestTorch::test_slow_test, test/test_torch.py::TestTorch::test_sobolengine_bounds, test/test_torch.py::TestTorch::test_sobolengine_bounds_scrambled, test/test_torch.py::TestTorch::test_sobolengine_continuing, test/test_torch.py::TestTorch::test_sobolengine_continuing_scrambled, test/test_torch.py::TestTorch::test_sobolengine_distribution, test/test_torch.py::TestTorch::test_sobolengine_distribution_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw, test/test_torch.py::TestTorch::test_sobolengine_draw_base2, test/test_torch.py::TestTorch::test_sobolengine_draw_base2_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw_scrambled, test/test_torch.py::TestTorch::test_sobolengine_fast_forward, test/test_torch.py::TestTorch::test_sobolengine_fast_forward_scrambled, test/test_torch.py::TestTorch::test_sobolengine_first_point, test/test_torch.py::TestTorch::test_sobolengine_high_dim, test/test_torch.py::TestTorch::test_sobolengine_raise, test/test_torch.py::TestTorch::test_sobolengine_reset, test/test_torch.py::TestTorch::test_sobolengine_reset_scrambled, test/test_torch.py::TestTorch::test_sort_neg_dim, test/test_torch.py::TestTorch::test_split_neg_dim, test/test_torch.py::TestTorch::test_split_with_sizes_copy_out, test/test_torch.py::TestTorch::test_squeeze_neg_dim, test/test_torch.py::TestTorch::test_std_neg_dim, test/test_torch.py::TestTorch::test_storage_base_init, test/test_torch.py::TestTorch::test_storage_base_new, test/test_torch.py::TestTorch::test_storage_byteswap, test/test_torch.py::TestTorch::test_storage_casts, test/test_torch.py::TestTorch::test_storage_cycle_via_dict, test/test_torch.py::TestTorch::test_storage_cycle_via_slots, test/test_torch.py::TestTorch::test_storage_dead_weak_ref, test/test_torch.py::TestTorch::test_storage_dealloc, test/test_torch.py::TestTorch::test_storage_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_zombie, test/test_torch.py::TestTorch::test_storage_dict_dealloc, test/test_torch.py::TestTorch::test_storage_error, test/test_torch.py::TestTorch::test_storage_error_no_attribute, test/test_torch.py::TestTorch::test_storage_finalizer_dealloc, test/test_torch.py::TestTorch::test_storage_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_zombie, test/test_torch.py::TestTorch::test_storage_preserve_nonhermetic_in_hermetic_context, test/test_torch.py::TestTorch::test_storage_resurrected_weak_ref, test/test_torch.py::TestTorch::test_storage_slot_dealloc, test/test_torch.py::TestTorch::test_storage_weakref_dealloc, test/test_torch.py::TestTorch::test_structseq_repr, test/test_torch.py::TestTorch::test_subclass_preserved, test/test_torch.py::TestTorch::test_subclass_tensors, test/test_torch.py::TestTorch::test_sum_neg_dim, test/test_torch.py::TestTorch::test_swap_basic, test/test_torch.py::TestTorch::test_swap_fail_slots, test/test_torch.py::TestTorch::test_t_not_2d_error, test/test_torch.py::TestTorch::test_tensor_base_init, test/test_torch.py::TestTorch::test_tensor_base_new, test/test_torch.py::TestTorch::test_tensor_ctor_scalar, test/test_torch.py::TestTorch::test_tensor_cycle_via_dict, test/test_torch.py::TestTorch::test_tensor_cycle_via_slots, test/test_torch.py::TestTorch::test_tensor_dead_weak_ref, test/test_torch.py::TestTorch::test_tensor_dict_dealloc, test/test_torch.py::TestTorch::test_tensor_finalizer_dealloc, test/test_torch.py::TestTorch::test_tensor_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_tensor_resurrected_weak_ref, test/test_torch.py::TestTorch::test_tensor_set, test/test_torch.py::TestTorch::test_tensor_set_errors, test/test_torch.py::TestTorch::test_tensor_slot_dealloc, test/test_torch.py::TestTorch::test_tensor_weakref_dealloc, test/test_torch.py::TestTorch::test_tensor_where_scalar, test/test_torch.py::TestTorch::test_tensoriterator_output_setup, test/test_torch.py::TestTorch::test_terminate_handler_on_crash, test/test_torch.py::TestTorch::test_to, test/test_torch.py::TestTorch::test_to_with_tensor, test/test_torch.py::TestTorch::test_topk_neg_dim, test/test_torch.py::TestTorch::test_torch_from_file, test/test_torch.py::TestTorch::test_transpose_neg_dim, test/test_torch.py::TestTorch::test_type, test/test_torch.py::TestTorch::test_type_alias, test/test_torch.py::TestTorch::test_type_conversion_via_dtype_name, test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning, test/test_torch.py::TestTorch::test_typed_storage_internal_no_warning, test/test_torch.py::TestTorch::test_unbind_neg_dim, test/test_torch.py::TestTorch::test_unflatten, test/test_torch.py::TestTorch::test_unfold_neg_dim, test/test_torch.py::TestTorch::test_unsqueeze_neg_dim, test/test_torch.py::TestTorch::test_upsample_nearest1d_meta, test/test_torch.py::TestTorch::test_upsample_nearest2d_meta, test/test_torch.py::TestTorch::test_var_neg_dim, test/test_torch.py::TestTorch::test_warn_types, test/test_torch.py::TestTorch::test_wildcard_import, test/test_torch.py::TestVitalSignsCudaCUDA::test_cuda_vitals_gpu_only_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_assertRaisesRegex_ignore_msg_non_native_device_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bfloat16_neg_abs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bool_tensor_value_change_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_add_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcdiv_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcmul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_atan2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_div_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_eq_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_fmod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ge_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_gt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_le_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lerp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_fill_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_scatter_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_select_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_min_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_mul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ne_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_pow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_remainder_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_sub_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_cuda_backward_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_euclidean_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_grad_p_lt_1_no_nan_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_same_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_all_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_internal_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_not_memory_dense_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_zero_stride_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_complex_half_experimental_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_constants_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_backward_agnostic_to_memory_format_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_math_view_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cov_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cpp_warnings_have_python_context_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cublas_config_nondeterministic_alert_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_cummin_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummin_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumprod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_replication_pad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_device_guard_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dim_function_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_discontiguous_out_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dtypetensor_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_errors_index_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_expected_failure_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_deterministic_path_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_one_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scale_will_not_overflow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_pass_itself_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_accumulation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_fused_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_separate_unscale_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_multiple_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_penalty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_state_dict_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_sparse_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_update_scale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_type_promotion_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_hook_remove_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_non_accumulate_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_invalid_shapes_grid_sampler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_set_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_signed_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumprod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumsum_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_logcumsumexp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lognormal_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_inplace_noncontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_large_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_clone_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_consistency_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_cpu_and_cuda_ops_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_empty_like_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_factory_like_functions_preserve_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_operators_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_preserved_after_permute_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_propagation_rules_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_shortcuts_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_module_share_memory_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_w_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_wo_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_gpu_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_rng_state_advance_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_copy_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_CTCLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_EmbeddingBag_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_NLLLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_bincount_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bicubic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_linear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_kthvalue_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_median_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_accumulate_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint2x4, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nullary_op_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pairwise_distance_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_norm_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pickle_gradscaler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pin_memory_from_constructor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_repeat_interleave_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scalar_check_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_non_unique_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_one_dim_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_zero_size_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_serialization_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_default_tensor_type_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_shift_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_skip_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_strides_propagation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_sync_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_set_errors_multigpu_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_shape_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_ternary_op_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_unfold_all_devices_and_dtypes_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_unfold_scalars_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_untyped_storage_meta_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_warn_always_caught_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_where_scalar_handcrafted_values_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_cpu_devices_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_devices_error_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_broadcast_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_noncontig_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_cuda_device_idx_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_device_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int8, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_uint8, test/test_torch.py::TestDevicePrecisionCUDA::test_index_add_bfloat16_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_multidevice_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_type_conversions_same_device_cuda 2024-04-03T12:03:59.2623703Z 2024-04-03T12:03:59.2624295Z Running test_nn 1/1 ... [2024-04-03 12:03:59.168064] 2024-04-03T12:03:59.2626258Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_nn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:03:59.169129] 2024-04-03T12:10:56.4998803Z 2024-04-03T12:10:56.5007350Z test_nn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_nn_1.1_fd4c49d4ed7f525b_.log 2024-04-03T12:10:56.6995274Z Running 2186 items in this shard: test/test_nn.py::TestNN::test_AdaptiveLogSoftmax, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_reduce, test/test_nn.py::TestNN::test_BCELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_CELU_no_batch_dim, test/test_nn.py::TestNN::test_CELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_CTCLoss_critical_target_len, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cpu, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cuda, test/test_nn.py::TestNN::test_CTCLoss_long_targets, test/test_nn.py::TestNN::test_CTCLoss_typechecks, test/test_nn.py::TestNN::test_CTCLoss_zero_infinity, test/test_nn.py::TestNN::test_Conv1d, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_cuda, test/test_nn.py::TestNN::test_Conv1d_dilated, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_groups, test/test_nn.py::TestNN::test_Conv1d_groups_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1size1, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2, test/test_nn.py::TestNN::test_Conv1d_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2size1, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same, test/test_nn.py::TestNN::test_Conv1d_pad_same2, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_valid, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_stride, test/test_nn.py::TestNN::test_Conv1d_stride_cuda, test/test_nn.py::TestNN::test_Conv1d_zero_batch, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups, test/test_nn.py::TestNN::test_Conv2d_groups_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same, test/test_nn.py::TestNN::test_Conv2d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_valid, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv2d_padding, test/test_nn.py::TestNN::test_Conv2d_padding_cuda, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_strided, test/test_nn.py::TestNN::test_Conv2d_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated_strided, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda, test/test_nn.py::TestNN::test_Conv3d_groups, test/test_nn.py::TestNN::test_Conv3d_groups_cuda, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_valid, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_stride, test/test_nn.py::TestNN::test_Conv3d_stride_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch, test/test_nn.py::TestNN::test_Conv3d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_groups, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_CrossMapLRN2d, test/test_nn.py::TestNN::test_CrossMapLRN2d_cuda, test/test_nn.py::TestNN::test_ELU_no_batch_dim, test/test_nn.py::TestNN::test_ELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Embedding, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max, test/test_nn.py::TestNN::test_EmbeddingBag_max_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean, test/test_nn.py::TestNN::test_EmbeddingBag_mean_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sparse, test/test_nn.py::TestNN::test_EmbeddingBag_sparse_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum, test/test_nn.py::TestNN::test_EmbeddingBag_sum_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx_cuda, test/test_nn.py::TestNN::test_Embedding_cuda, test/test_nn.py::TestNN::test_Embedding_discontiguous, test/test_nn.py::TestNN::test_Embedding_discontiguous_cuda, test/test_nn.py::TestNN::test_Embedding_sparse, test/test_nn.py::TestNN::test_Embedding_sparse_cuda, test/test_nn.py::TestNN::test_Flatten, test/test_nn.py::TestNN::test_Flatten_cuda, test/test_nn.py::TestNN::test_Flatten_no_batch_dim, test/test_nn.py::TestNN::test_Flatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Fold, test/test_nn.py::TestNN::test_Fold_cuda, test/test_nn.py::TestNN::test_Fold_int_input, test/test_nn.py::TestNN::test_Fold_int_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input_cuda, test/test_nn.py::TestNN::test_GELU_no_batch_dim, test/test_nn.py::TestNN::test_GELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_GLU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_HuberLoss_delta, test/test_nn.py::TestNN::test_HuberLoss_delta_cuda, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_reduce, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_LSTM_cell, test/test_nn.py::TestNN::test_LSTM_cell_forward_hidden_size, test/test_nn.py::TestNN::test_LSTM_cell_forward_input_size, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_cuda, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval_cuda, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear, test/test_nn.py::TestNN::test_Linear_cuda, test/test_nn.py::TestNN::test_Linear_no_batch_dim, test/test_nn.py::TestNN::test_Linear_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear_no_bias, test/test_nn.py::TestNN::test_Linear_no_bias_cuda, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_reduce, test/test_nn.py::TestNN::test_MSELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MaxUnpool1d_net, test/test_nn.py::TestNN::test_MaxUnpool1d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net, test/test_nn.py::TestNN::test_MaxUnpool3d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Mish_no_batch_dim, test/test_nn.py::TestNN::test_Mish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ModuleDict, test/test_nn.py::TestNN::test_ModuleList, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_reduce, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg_cuda, test/test_nn.py::TestNN::test_PReLU_backward_requires_grad_false, test/test_nn.py::TestNN::test_PReLU_no_batch_dim, test/test_nn.py::TestNN::test_PReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args_cuda, test/test_nn.py::TestNN::test_ParameterDict, test/test_nn.py::TestNN::test_ParameterDict_replication, test/test_nn.py::TestNN::test_ParameterList, test/test_nn.py::TestNN::test_ParameterList_meta, test/test_nn.py::TestNN::test_ParameterList_replication, test/test_nn.py::TestNN::test_PixelShuffle, test/test_nn.py::TestNN::test_PixelShuffle_cuda, test/test_nn.py::TestNN::test_PixelUnshuffle, test/test_nn.py::TestNN::test_PixelUnshuffle_cuda, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_RNN_cell, test/test_nn.py::TestNN::test_RNN_cell_forward_zero_hidden_size, test/test_nn.py::TestNN::test_RNN_cell_no_broadcasting, test/test_nn.py::TestNN::test_RNN_change_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_no_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_with_dropout, test/test_nn.py::TestNN::test_RNN_cudnn_weight_norm, test/test_nn.py::TestNN::test_RNN_dropout, test/test_nn.py::TestNN::test_RNN_dropout_state, test/test_nn.py::TestNN::test_RNN_input_size_zero, test/test_nn.py::TestNN::test_RNN_nonlinearity, test/test_nn.py::TestNN::test_RNN_nonlinearity_passed_as_arg, test/test_nn.py::TestNN::test_RReLU, test/test_nn.py::TestNN::test_RReLU_cuda, test/test_nn.py::TestNN::test_RReLU_no_batch_dim, test/test_nn.py::TestNN::test_RReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down, test/test_nn.py::TestNN::test_RReLU_with_up_down_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar_cuda, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReLU_no_batch_dim, test/test_nn.py::TestNN::test_ReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d, test/test_nn.py::TestNN::test_ReplicationPad3d_complex, test/test_nn.py::TestNN::test_ReplicationPad3d_complex_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SELU_no_batch_dim, test/test_nn.py::TestNN::test_SELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sequential_add, test/test_nn.py::TestNN::test_Sequential_append, test/test_nn.py::TestNN::test_Sequential_delitem, test/test_nn.py::TestNN::test_Sequential_extend, test/test_nn.py::TestNN::test_Sequential_getitem, test/test_nn.py::TestNN::test_Sequential_iadd, test/test_nn.py::TestNN::test_Sequential_imul, test/test_nn.py::TestNN::test_Sequential_insert, test/test_nn.py::TestNN::test_Sequential_insert_fail_case, test/test_nn.py::TestNN::test_Sequential_mul, test/test_nn.py::TestNN::test_Sequential_pop, test/test_nn.py::TestNN::test_Sequential_rmul, test/test_nn.py::TestNN::test_Sequential_setitem, test/test_nn.py::TestNN::test_Sequential_setitem_named, test/test_nn.py::TestNN::test_SiLU_no_batch_dim, test/test_nn.py::TestNN::test_SiLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_beta_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta_cuda, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_Softplus_no_batch_dim, test/test_nn.py::TestNN::test_Softplus_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softsign_no_batch_dim, test/test_nn.py::TestNN::test_Softsign_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanh_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Threshold_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_Transformer_cell, test/test_nn.py::TestNN::test_Transformer_multilayer_coder, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Unfold, test/test_nn.py::TestNN::test_Unfold_cuda, test/test_nn.py::TestNN::test_Unfold_int_input, test/test_nn.py::TestNN::test_Unfold_int_input_cuda, test/test_nn.py::TestNN::test_adaptive_log_softmax, test/test_nn.py::TestNN::test_add_module, test/test_nn.py::TestNN::test_add_module_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_affine_grid, test/test_nn.py::TestNN::test_affine_grid_3d, test/test_nn.py::TestNN::test_affine_grid_error_checking, test/test_nn.py::TestNN::test_assignment, test/test_nn.py::TestNN::test_batch_norm_update_stats, test/test_nn.py::TestNN::test_batchnorm_buffer_update_when_stats_are_not_tracked, test/test_nn.py::TestNN::test_batchnorm_cudnn_half, test/test_nn.py::TestNN::test_batchnorm_cudnn_nhwc, test/test_nn.py::TestNN::test_batchnorm_load_state_dict, test/test_nn.py::TestNN::test_batchnorm_nhwc_cpu, test/test_nn.py::TestNN::test_batchnorm_nhwc_cuda, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_BatchNorm2d, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_SyncBatchNorm, test/test_nn.py::TestNN::test_batchnorm_nonaffine_cuda_half_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_bias_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_less_than_one_value_per_channel, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_mean_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_or_running_mean_have_forward_grad, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_weight_is_not_same_size_as_input, test/test_nn.py::TestNN::test_bce_loss_always_nonnegative, test/test_nn.py::TestNN::test_bce_loss_broadcasts_weights, test/test_nn.py::TestNN::test_bce_loss_input_range, test/test_nn.py::TestNN::test_bce_loss_size_mismatch, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_pos_weights, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_weights, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss_large_tensors_with_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_forward_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bce_with_logits_ones_in_pos_weights_are_the_same_as_none, test/test_nn.py::TestNN::test_bce_with_logits_raises_if_target_and_input_are_different_size, test/test_nn.py::TestNN::test_bce_with_logits_stability, test/test_nn.py::TestNN::test_bce_with_logits_with_pos_weight_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bilinear, test/test_nn.py::TestNN::test_bilinear_broadcasting, test/test_nn.py::TestNN::test_bilinear_no_bias, test/test_nn.py::TestNN::test_bilinear_non_contiguous, test/test_nn.py::TestNN::test_broadcast_double_backwards_gpu, test/test_nn.py::TestNN::test_broadcast_no_grad, test/test_nn.py::TestNN::test_broadcast_not_requiring_grad, test/test_nn.py::TestNN::test_buffer_not_persistent, test/test_nn.py::TestNN::test_buffer_not_persistent_assign, test/test_nn.py::TestNN::test_buffer_not_persistent_del, test/test_nn.py::TestNN::test_buffer_not_persistent_load, test/test_nn.py::TestNN::test_buffer_not_persistent_overwrite, test/test_nn.py::TestNN::test_buffers_and_named_buffers, test/test_nn.py::TestNN::test_call_supports_python_dict_output, test/test_nn.py::TestNN::test_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_children, test/test_nn.py::TestNN::test_container_copy, test/test_nn.py::TestNN::test_convert_sync_batchnorm, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_diff_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_nonexpandable_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_invalid_shape, test/test_nn.py::TestNN::test_cosine_embedding_loss_margin_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_with_diff_type, test/test_nn.py::TestNN::test_cosine_similarity, test/test_nn.py::TestNN::test_cross_entropy_loss, test/test_nn.py::TestNN::test_cross_entropy_loss_precision, test/test_nn.py::TestNN::test_cross_entropy_loss_zero_div, test/test_nn.py::TestNN::test_cudnn_forward_exception, test/test_nn.py::TestNN::test_cudnn_rnn_dropout_states_device, test/test_nn.py::TestNN::test_cudnn_weight_format, test/test_nn.py::TestNN::test_cudnn_weight_tying, test/test_nn.py::TestNN::test_dir, test/test_nn.py::TestNN::test_dir_digit, test/test_nn.py::TestNN::test_elu_inplace_gradgrad, test/test_nn.py::TestNN::test_elu_inplace_on_view, test/test_nn.py::TestNN::test_error_RNN_seq_len_zero, test/test_nn.py::TestNN::test_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_get_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_set_extra_state, test/test_nn.py::TestNN::test_extra_state_non_dict, test/test_nn.py::TestNN::test_fb_fc_packed, test/test_nn.py::TestNN::test_flatten, test/test_nn.py::TestNN::test_fold_invalid_arg, test/test_nn.py::TestNN::test_fractional_max_pool2d_invalid_output_ratio, test/test_nn.py::TestNN::test_gaussian_nll_loss_args, test/test_nn.py::TestNN::test_gaussian_nll_loss_broadcasting, test/test_nn.py::TestNN::test_get_buffer, test/test_nn.py::TestNN::test_get_buffer_from_submodules, test/test_nn.py::TestNN::test_getattr_with_property, test/test_nn.py::TestNN::test_grid_sample, test/test_nn.py::TestNN::test_grid_sample_3d, test/test_nn.py::TestNN::test_grid_sample_error_checking, test/test_nn.py::TestNN::test_grid_sample_nearest_neighbor_rounding_mode_consistency, test/test_nn.py::TestNN::test_hardtanh_backward, test/test_nn.py::TestNN::test_hardtanh_inplace_gradgrad, test/test_nn.py::TestNN::test_huber_loss_invalid_delta, test/test_nn.py::TestNN::test_inplace_thnn, test/test_nn.py::TestNN::test_interpolate, test/test_nn.py::TestNN::test_interpolate_bicubic_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_buffer_overflow, test/test_nn.py::TestNN::test_interpolate_illegal_memory_access, test/test_nn.py::TestNN::test_interpolate_linear_1d, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d, test/test_nn.py::TestNN::test_interpolate_nearest_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d, test/test_nn.py::TestNN::test_interpolate_nearest_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_undefined_behavior_casting, test/test_nn.py::TestNN::test_kl_div_log_softmax_target, test/test_nn.py::TestNN::test_kl_div_with_diff_type, test/test_nn.py::TestNN::test_kl_div_with_diff_type_log_target, test/test_nn.py::TestNN::test_l1_loss_correct, test/test_nn.py::TestNN::test_layer_norm_eps, test/test_nn.py::TestNN::test_layer_norm_grads_with_create_graph_flag, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_broadcasting, test/test_nn.py::TestNN::test_linear_raise_on_scalar_input, test/test_nn.py::TestNN::test_log_softmax_dim0, test/test_nn.py::TestNN::test_log_softmax_dim0_cuda, test/test_nn.py::TestNN::test_log_softmax_dim3, test/test_nn.py::TestNN::test_log_softmax_dim3_cuda, test/test_nn.py::TestNN::test_log_softmax_lastdim, test/test_nn.py::TestNN::test_log_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_log_softmax_scalar, test/test_nn.py::TestNN::test_log_softmax_scalar_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial, test/test_nn.py::TestNN::test_log_softmax_spatial_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial_special, test/test_nn.py::TestNN::test_log_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_loss_equal_input_target_shape, test/test_nn.py::TestNN::test_margin_ranking_loss_margin_no_reduce, test/test_nn.py::TestNN::test_margin_ranking_loss_no_reduce, test/test_nn.py::TestNN::test_max_pool1d_invalid_output_size, test/test_nn.py::TestNN::test_module_apply_inplace_op, test/test_nn.py::TestNN::test_module_backcompat, test/test_nn.py::TestNN::test_module_super_init, test/test_nn.py::TestNN::test_module_to_argparse, test/test_nn.py::TestNN::test_modules, test/test_nn.py::TestNN::test_mse_loss_size_warning, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_nn.py::TestNN::test_named_children, test/test_nn.py::TestNN::test_named_modules, test/test_nn.py::TestNN::test_named_parameters_remove_duplicate, test/test_nn.py::TestNN::test_native_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_nested_tensor_from_mask, test/test_nn.py::TestNN::test_nested_tensor_from_mask_error, test/test_nn.py::TestNN::test_no_grad, test/test_nn.py::TestNN::test_non_leaf_parameters, test/test_nn.py::TestNN::test_normalize, test/test_nn.py::TestNN::test_overwrite_module_params_on_conversion, test/test_nn.py::TestNN::test_pack_sequence_batch_sizes_throw, test/test_nn.py::TestNN::test_pad_scalar_error, test/test_nn.py::TestNN::test_padding_list, test/test_nn.py::TestNN::test_pairwise_distance, test/test_nn.py::TestNN::test_parameter_assignment, test/test_nn.py::TestNN::test_parameterlistdict_pickle, test/test_nn.py::TestNN::test_parameterlistdict_setting_attributes, test/test_nn.py::TestNN::test_parameters_and_named_parameters, test/test_nn.py::TestNN::test_parameters_to_vector, test/test_nn.py::TestNN::test_parse_to, test/test_nn.py::TestNN::test_partial_flat_weights, test/test_nn.py::TestNN::test_pdist, test/test_nn.py::TestNN::test_pdist_cpu_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_cuda_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_empty_col, test/test_nn.py::TestNN::test_pdist_empty_row, test/test_nn.py::TestNN::test_pdist_large, test/test_nn.py::TestNN::test_pdist_zeros, test/test_nn.py::TestNN::test_pixel_shuffle_nhwc_cpu, test/test_nn.py::TestNN::test_pixel_shuffle_unshuffle, test/test_nn.py::TestNN::test_pointwise_loss_broadcast, test/test_nn.py::TestNN::test_pointwise_loss_target_grad_none_reduction, test/test_nn.py::TestNN::test_projections_errors_on_gru_and_rnn, test/test_nn.py::TestNN::test_projections_lstm_args_check, test/test_nn.py::TestNN::test_projections_lstm_check_device, test/test_nn.py::TestNN::test_projections_lstm_initial_hidden_state, test/test_nn.py::TestNN::test_register_buffer_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_not_tensor, test/test_nn.py::TestNN::test_register_parameter_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_state_dict_pre_hook, test/test_nn.py::TestNN::test_register_state_dict_pre_hook_lazy_module, test/test_nn.py::TestNN::test_relu_inplace_on_view, test/test_nn.py::TestNN::test_repr, test/test_nn.py::TestNN::test_requires_grad_, test/test_nn.py::TestNN::test_rnn_args_check, test/test_nn.py::TestNN::test_rnn_check_device, test/test_nn.py::TestNN::test_rnn_initial_hidden_state, test/test_nn.py::TestNN::test_rnn_weight_norm, test/test_nn.py::TestNN::test_share_memory, test/test_nn.py::TestNN::test_smoothl1loss_intergral_target, test/test_nn.py::TestNN::test_smoothl1loss_negative_beta_not_supported, test/test_nn.py::TestNN::test_softmax_functional_dim0, test/test_nn.py::TestNN::test_softmax_functional_dim0_cuda, test/test_nn.py::TestNN::test_softmax_functional_dim3, test/test_nn.py::TestNN::test_softmax_functional_dim3_cuda, test/test_nn.py::TestNN::test_softmax_functional_scalar, test/test_nn.py::TestNN::test_softmax_functional_scalar_cuda, test/test_nn.py::TestNN::test_softmax_lastdim, test/test_nn.py::TestNN::test_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_dtype, test/test_nn.py::TestNN::test_softmax_lastdim_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial, test/test_nn.py::TestNN::test_softmax_spatial_cuda, test/test_nn.py::TestNN::test_softmax_spatial_dtype, test/test_nn.py::TestNN::test_softmax_spatial_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial_special, test/test_nn.py::TestNN::test_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_softmin, test/test_nn.py::TestNN::test_spectral_norm, test/test_nn.py::TestNN::test_spectral_norm_dim, test/test_nn.py::TestNN::test_spectral_norm_forward, test/test_nn.py::TestNN::test_spectral_norm_load_state_dict, test/test_nn.py::TestNN::test_spectral_norm_pickle, test/test_nn.py::TestNN::test_state_dict, test/test_nn.py::TestNN::test_swap_module_params_fails_after_forward, test/test_nn.py::TestNN::test_sync_batchnorm_accuracy_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_backward_elemt, test/test_nn.py::TestNN::test_threshold_bfloat16_half, test/test_nn.py::TestNN::test_threshold_int, test/test_nn.py::TestNN::test_to, test/test_nn.py::TestNN::test_train_errors_for_invalid_mode, test/test_nn.py::TestNN::test_transformer_args_check, test/test_nn.py::TestNN::test_transformer_layer_args_check, test/test_nn.py::TestNN::test_transformerdecoder, test/test_nn.py::TestNN::test_transformerdecoderlayer, test/test_nn.py::TestNN::test_transformerdecoderlayer_gelu, test/test_nn.py::TestNN::test_triplet_margin_loss, test/test_nn.py::TestNN::test_triplet_margin_loss_no_reduce, test/test_nn.py::TestNN::test_triplet_margin_loss_swap, test/test_nn.py::TestNN::test_triplet_margin_loss_swap_no_reduce, test/test_nn.py::TestNN::test_type, test/test_nn.py::TestNN::test_unflatten, test/test_nn.py::TestNN::test_unflatten_invalid_arg, test/test_nn.py::TestNN::test_unfold_invalid_arg, test/test_nn.py::TestNN::test_upsamplingBilinear2d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingLinear1d, test/test_nn.py::TestNN::test_upsamplingLinear1d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingTrilinear3d_spatial_invariance, test/test_nn.py::TestNN::test_upsampling_bfloat16, test/test_nn.py::TestNN::test_upsampling_not_recompute_scale_factor, test/test_nn.py::TestNN::test_upsampling_small_scale, test/test_nn.py::TestNN::test_vector_to_parameters, test/test_nn.py::TestNN::test_weight_norm, test/test_nn.py::TestNN::test_weight_norm_pickle, test/test_nn.py::TestNN::test_zero_grad, test/test_nn.py::TestFusionEval::test_fuse_module_eval_numerics, test/test_nn.py::TestConstantPadNd::test_constant_pad_nd, test/test_nn.py::TestConstantPadNd::test_preserves_memory_format, test/test_nn.py::TestAddRelu::test_add_relu, test/test_nn.py::TestAddRelu::test_add_relu_broadcasting, test/test_nn.py::TestFunctionalPickle::test_pickle_softsign, test/test_nn.py::TestFusionUtils::test_fuse_conv_bn_requires_grad, test/test_nn.py::TestFusionUtils::test_fuse_linear_bn_requires_grad, test/test_nn.py::TestUtils::test_consume_prefix_in_state_dict_if_present, test/test_nn.py::TestNNDeviceTypeCUDA::test_BatchNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Bilinear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_empty_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GRU_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_memory_format_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_raises_error_if_one_value_per_group_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm1d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm2d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm3d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LocalResponseNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_warnings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad1d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Transformer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Unfold_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_adaptiveavg_pool1d_shmem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate45_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate90_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_3d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_update_stats_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_channel_shuffle_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_error_if_nonfinite_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_consistent_index_target_and_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_errors_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_weight_ignore_indices_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_with_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_index_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_one_hot_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_all_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_device_mask_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_with_neg_alpha_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_fold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_glu_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_bfloat16_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardsigmoid_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_for_single_spatial_element_during_training_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_less_than_one_value_per_channel_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_invalid_reduction_strings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_weight_bias_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_neg_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_zero_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_linear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_logsigmoid_out_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_lstmcell_backward_only_one_output_grad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_TxT_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_devices_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_forward_with_nans_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_mask_types_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_transformer_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_mish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_non_recursive_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_all_ignored_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_byte_target_matches_long_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_target_dim_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_mismatched_batch_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_out_of_bounds_ignore_index_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_total_weight_is_zero_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nonlinearity_propagate_nan_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_one_hot_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_overwrite_module_params_on_conversion_cpu_device_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_prelu_backward_32bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_replicatepad_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_save_lstm_compatibility_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_silu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_skip_init_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_vs_huber_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smoothl1loss_backward_zero_beta_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_forward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_low_threshold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_negative_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_threshold_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_fast_path_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_default_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_correctness_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_fail_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_rocm_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_rescale_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float32 2024-04-03T12:10:56.8807594Z 2024-04-03T12:10:56.8808249Z Running test_ops 1/10 ... [2024-04-03 12:10:56.512662] 2024-04-03T12:10:56.8809031Z Running test_ops 2/10 ... [2024-04-03 12:10:56.513068] 2024-04-03T12:10:56.8811644Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=2', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:10:56.513758] 2024-04-03T12:10:56.8814662Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=1', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:10:56.513855] 2024-04-03T12:10:56.8816441Z Running test_ops 4/10 ... [2024-04-03 12:10:56.513767] 2024-04-03T12:10:56.8817167Z Running test_ops 3/10 ... [2024-04-03 12:10:56.513761] 2024-04-03T12:10:56.8819093Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=4', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:10:56.515214] 2024-04-03T12:10:56.8822110Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=3', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:10:56.515322] 2024-04-03T12:22:46.5083385Z 2024-04-03T12:22:46.5085901Z test_ops 2/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_2.10_dbac769edb38711e_.log 2024-04-03T12:22:46.6998503Z Running 3052 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_permuted_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_randn_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addbmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gather_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_imag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_le_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cond_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanmedian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_without_cudnn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_logsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_head_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_constant_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_scaled_dot_product_attention_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_upsample_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_permute_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_put_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_mm_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_airy_ai_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_errors_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ge_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_bartlett_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cov_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_reduce_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_angle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_gaussian_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_H_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_offsets_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cov_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log10_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_slice_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unravel_index_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_unbiased_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rmul___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flip_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_not_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flip_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_item_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ldexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_sparse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmatmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rpow___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isfinite_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_lerp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_not_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_positive_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_repeat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tensor_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_true_divide_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_clone_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_permuted_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svdvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_unpack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_silu_complex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_renorm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_square_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tile_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsqueeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view_H_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___radd___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rsub___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_T_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_byte_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nan_to_num_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nextafter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_erfcx_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_stft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addbmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_any_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bool_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_conj_physical_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cummax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gather_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_histc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_householder_product_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_unpack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_msort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nan_to_num_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanquantile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nansum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pinverse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randint_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scalar_tensor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_h_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trapz_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_triu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_unbiased_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rxor___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gcd_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float16, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_block_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_digamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_float_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ge_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_item_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_xor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i1e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_unflatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_left_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_not_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_cartesian_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagflat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_geqrf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_not_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_or_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_unpack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_dropout_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ormqr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_real_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_repeat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signbit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_xlogy_cuda_float32 2024-04-03T12:22:46.8781324Z 2024-04-03T12:22:46.8781928Z Running test_ops 5/10 ... [2024-04-03 12:22:46.516777] 2024-04-03T12:22:46.8783942Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=5', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:22:46.517373] 2024-04-03T12:23:03.1985356Z 2024-04-03T12:23:03.1990985Z test_ops 4/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_4.10_dec7da1c824451ee_.log 2024-04-03T12:23:03.4110663Z Running 3197 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_put_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_scalar_tensor_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_slice_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rmatmul___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_bool_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_char_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_int_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_round_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__upsample_bilinear2d_aa_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_einsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_eq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ge_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_half_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_4inputs_with_extra_args_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mH_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_median_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_with_train_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_one_hot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_neg_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors___ror___cuda, test/test_ops.py::TestCommonCUDA::test_errors_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diff_cuda, test/test_ops.py::TestCommonCUDA::test_errors_eye_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_errors_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_errors_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_errors_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_errors_kthvalue_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_lt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_errors_triu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_where_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___getitem___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rpow___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argsort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_constant_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zero__cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___ror___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cartesian_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ldexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mH_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mT_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_softsign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_list_args_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_take_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mT_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_addbmm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_out___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestCommonCUDA::test_out_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__upsample_bilinear2d_aa_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_angle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argwhere_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bincount_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diff_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_4inputs_with_extra_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_ldl_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorsolve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mH_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanmean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_instance_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_kl_div_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_multilabel_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softsign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_upsample_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_put_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_airy_ai_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_sparse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zero__cuda, test/test_ops.py::TestCommonCUDA::test_out_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cov_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_uniform_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zero__cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsafe_chunk_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__chunk_cat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_int_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_contiguous_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diag_embed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expm1_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_istft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ne_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_randn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_combinations_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_contiguous_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagflat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_permuted_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_slogdet_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log10_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mT_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_positive_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_short_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tensordot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trapezoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsafe_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rdiv___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_char_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_square_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_block_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_column_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_constant_pad_nd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cov_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diagonal_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_double_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lerp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_det_singular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_multi_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nansum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ormqr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_squeeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_triangular_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_conj_physical_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_count_nonzero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_digamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_frac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_item_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_not_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_xor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_var_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cfloat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_chalf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cholesky_inverse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_permuted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_inner_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isneginf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cholesky_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cond_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vector_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_ctc_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_rms_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_selu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randn_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_neg_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rsqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_blackman_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_sampled_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_y0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_legendre_polynomial_p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sum_to_size_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_sparse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_triangular_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_uniform_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_unbiased_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_multinomial_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_blackman_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___getitem___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rand___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags___rmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_polar_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cauchy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log1p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_masked_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_mul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_remainder_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rot90_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tensor_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tril_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_true_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__softmax_backward_data_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_angle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_or_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_cdouble_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_histc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hypot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isposinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lerp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cond_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_det_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_inv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_slogdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_msort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanmedian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_glu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_inf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_number_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_outer_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_remainder_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_select_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestTagsCUDA::test_tags_transpose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_unflatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unfold_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_complex_cuda_float32 2024-04-03T12:23:03.6049398Z 2024-04-03T12:23:03.6050080Z Running test_ops 6/10 ... [2024-04-03 12:23:03.208343] 2024-04-03T12:23:03.6053135Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=6', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:23:03.209074] 2024-04-03T12:23:38.7537222Z 2024-04-03T12:23:38.7542919Z test_ops 3/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_3.10_c151b39a64edb912_.log 2024-04-03T12:23:38.9912352Z Running 3236 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__native_batch_norm_legit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_le_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_arange_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bool_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_char_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clone_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_kron_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_det_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_tensorinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_median_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cosine_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_searchsorted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_bartlett_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_errors_amin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_errors_complex_cuda, test/test_ops.py::TestCommonCUDA::test_errors_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diag_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_eq_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gather_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_mul_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_pow_cuda, test/test_ops.py::TestCommonCUDA::test_errors_roll_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_errors_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_errors_tril_cuda, test/test_ops.py::TestCommonCUDA::test_errors_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_angle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_corrcoef_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_permuted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nansum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diff_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resolve_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_general_hamming_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_short_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_baddbmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cholesky_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cross_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gather_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_istft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_item_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_kthvalue_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_ldl_factor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_similarity_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_grid_sample_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardswish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_local_response_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_multi_head_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ones_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_short_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_pointwise_tag_coverage_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rsub___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dist_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mT_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_list_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_true_divide_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_H_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_float_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_masked_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_trace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unbind_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_abs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addbmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_half_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_put_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isinf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isreal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_inv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_and_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_not_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_or_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_inf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ormqr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rand_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_renorm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tile_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_true_divide_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_as_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___getitem___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rsub___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addcdiv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_block_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_renorm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_roll_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_transpose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_view_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_argwhere_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_contiguous_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_corrcoef_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diagonal_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_full_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_inner_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isreal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ldexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigvalsh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vander_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_std_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_matrix_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_rms_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nonzero_static_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resize__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rot90_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unfold_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsafe_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__chunk_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bool_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_double_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_float_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_block_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_geometric_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_gt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isposinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_movedim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sgn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unbind_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unfold_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bfloat16_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_chunk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clone_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_corrcoef_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cummin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumulative_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_deg2rad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_frexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_geometric_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_2inputs_2outputs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_le_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eig_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vecdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_and_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_xor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mH_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_logaddexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_with_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_batch_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardsigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_area_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_local_response_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_threshold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nonzero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rand_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hann_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_kaiser_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_list_args_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_unbiased_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_topk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_true_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unbind_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_chunk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsqueeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_vstack_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igammac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_multinomial_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_igammac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_uint8, test/test_ops.py::TestTagsCUDA::test_tags___rmatmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_byte_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_float_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_not_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_count_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_geometric_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hypot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_isclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lcm_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_lgamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log10_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_square_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_trace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_transpose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_partial_views_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_xor_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_bool_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cauchy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_constant_pad_nd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cov_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_permuted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eye_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_hfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_index_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_istft_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lcm_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_linalg_qr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vander_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vecdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log1p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_log_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_normalize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_maximum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nansum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_positive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_qr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randint_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randn_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ravel_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resolve_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rot90_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rsub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i0e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_zeta_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_list_args_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tensor_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unravel_index_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_zeros_cuda_float32 2024-04-03T12:23:39.1797150Z 2024-04-03T12:23:39.1797880Z Running test_ops 7/10 ... [2024-04-03 12:23:38.763256] 2024-04-03T12:23:39.1800260Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=7', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:23:38.764046] 2024-04-03T12:23:48.7684458Z 2024-04-03T12:23:48.7687548Z test_ops 1/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_1.10_32c20bd2f2b0e279_.log 2024-04-03T12:23:48.9693926Z Running 3309 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing___getitem___cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nanmean_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___radd___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_inverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_frac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_binary_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_roll_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zero__cuda, test/test_ops.py::TestCommonCUDA::test_errors_amax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cov_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sub_cuda, test/test_ops.py::TestCommonCUDA::test_errors_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_argwhere_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_combinations_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gather_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_outer_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_interleave_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zero__cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rmatmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___ror___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__batch_norm_with_update_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__native_batch_norm_legit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_arange_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_stft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_combinations_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_geqrf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vander_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_fractional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resize_as__cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_searchsorted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_with_sizes_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_take_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_topk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unflatten_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_item_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_or_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_true_divide_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_constant_pad_nd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_kron_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eig_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_put_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize_as__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_short_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_abs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_contiguous_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_trace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_angle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_einsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_kron_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logcumsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nanmean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_normal_in_place_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_randn_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_list_args_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___getitem___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmod___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_any_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_chunk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clone_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_deg2rad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flip_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_le_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_mul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unsqueeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_decomposed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ceil_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flipud_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvalsh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_tensorinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vander_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mT_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_movedim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_grid_sample_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_circular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_rrelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_nuc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_quantile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_laguerre_polynomial_l_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensordot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zero__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_like_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex32, test/test_ops.py::TestTagsCUDA::test_tags___rsub___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bool_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_short_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_allclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_physical_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_istft_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_lerp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_not_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nextafter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rsub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_trunc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_xlogy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argsort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bernoulli_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_right_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_equal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_full_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isreal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ldexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lgamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_householder_product_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_multinomial_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nextafter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_y1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_square_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trunc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zero__cuda_float32 2024-04-03T12:23:49.1438603Z 2024-04-03T12:23:49.1439244Z Running test_ops 8/10 ... [2024-04-03 12:23:48.778092] 2024-04-03T12:23:49.1441206Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=8', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:23:48.778640] 2024-04-03T12:32:43.0222035Z 2024-04-03T12:32:43.0229738Z test_ops 5/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_5.10_d6135ccdc22f06c9_.log 2024-04-03T12:32:43.2329593Z Running 3202 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing__chunk_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_angle_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mH_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_repeat_interleave_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_combinations_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cov_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_rank_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_solve_triangular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vander_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_msort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_binary_cross_entropy_with_logits_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool3d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multilabel_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resolve_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_i1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_take_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensordot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_sparse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tril_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_errors_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_le_cuda, test/test_ops.py::TestCommonCUDA::test_errors_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ne_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmul___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_histc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mH_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_static_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___getitem___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rand___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rmul___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rxor___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isnan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_binary_return_by_ref_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_kron_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_mean_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pad_constant_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_laguerre_polynomial_l_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dist_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_airy_ai_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_blackman_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_int_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_le_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argsort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cholesky_inverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_int_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cond_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_det_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_matrix_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_embedding_bag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_feature_alpha_dropout_with_train_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_linear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_logsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_one_hot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_scaled_dot_product_attention_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_outer_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_roll_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_neg_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_polygamma_special_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tensordot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_return_by_ref_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize_as__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zeros_like_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_double_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_short_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_acosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_allclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expand_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isreal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log1p_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ravel_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_asin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bool_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diff_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_double_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_einsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expand_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expm1_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_unary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_xor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_normalize_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_matmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ne_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_randn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unfold_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsafe_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_H_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___radd___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__chunk_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_clone_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flip_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_ne_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_pow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reciprocal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rsub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_std_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_view_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_abs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_half_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cond_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_det_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log1p_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mH_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_permute_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pinverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rand_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_reciprocal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_with_sizes_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_with_sizes_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__native_batch_norm_legit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_bucketize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_min_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flipud_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hypot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_minimum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_empty_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_randn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_repeat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sinc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__segment_reduce_lengths_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_angle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_floor_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_eye_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_float_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_frac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hypot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_binary_return_by_ref_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_unary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_reduction_with_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_minimum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_dropout_backward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cross_entropy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_instance_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_constant_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nonzero_static_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_normal_in_place_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pca_lowrank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polar_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rad2deg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randint_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reshape_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sgn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_gaussian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sinc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_erfcx_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_he_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i1e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_consecutive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rand___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rand___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_blackman_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_count_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_multinomial_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex32, test/test_ops.py::TestTagsCUDA::test_tags___rmod___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_and_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_heaviside_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_igamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_igammac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_or_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_positive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_reciprocal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_reshape_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addcdiv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_decomposed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_all_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_any_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bucketize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumprod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gcd_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_half_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_item_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_kthvalue_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logcumsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ones_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resolve_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scalar_tensor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_y0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_triangular_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unbind_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unique_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vsplit_cuda_float32 2024-04-03T12:32:43.3991127Z 2024-04-03T12:32:43.3991747Z Running test_ops 9/10 ... [2024-04-03 12:32:43.032068] 2024-04-03T12:32:43.3994452Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=9', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:32:43.032551] 2024-04-03T12:33:39.6421763Z 2024-04-03T12:33:39.6425152Z test_ops 7/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_7.10_42fb8f0437415d49_.log 2024-04-03T12:33:39.8498771Z Running 3299 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_short_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_signbit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_aminmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_any_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argsort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_geqrf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_kthvalue_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logdet_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_area_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_circular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_4_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_qr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__flash_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_where_cuda, test/test_ops.py::TestCommonCUDA::test_errors___radd___cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_mean_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_aminmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_unary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_hermite_polynomial_h_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_scaled_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_integral_dtype__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_out_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_einsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ge_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_kron_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_det_singular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_singular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_4_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resolve_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch__scaled_mm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_split_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vdot_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rsub___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_abs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_imag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_and_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_positive_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sgn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_count_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_equal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eye_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vander_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logcumsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logdet_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_unpack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mH_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nanmean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ravel_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_roll_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trapz_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triangular_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_long_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_column_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcdiv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumulative_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dist_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flip_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_put_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_item_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_householder_product_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_singular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_slogdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mT_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_nuc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_slice_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_abs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isfinite_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal__in_place_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_where_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_xlogy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_allclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argsort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_partial_views_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_asin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cartesian_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumprod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flip_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isfinite_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lerp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_hermitian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_slogdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logcumsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_not_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanmean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_batch_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_bag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_reflect_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ones_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ormqr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_permute_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_ndtri_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_copy_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_count_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float64, test/test_ops.py::TestTagsCUDA::test_tags___rxor___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addcdiv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_eq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erfinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expm1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isreal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_repeat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_var_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_where_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argwhere_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_inverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_count_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expm1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_multi_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_tensorinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mH_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_embedding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_prelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_threshold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rsqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_slice_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_stft_cuda_float32 2024-04-03T12:33:40.0342440Z 2024-04-03T12:33:40.0343667Z Running test_ops 10/10 ... [2024-04-03 12:33:39.651223] 2024-04-03T12:33:40.0345756Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_ops.py', '--shard-id=10', '--num-shards=10', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:33:39.652044] 2024-04-03T12:34:12.5676070Z 2024-04-03T12:34:12.5679689Z test_ops 8/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_8.10_abe63000578516da_.log 2024-04-03T12:34:12.7584887Z Running 3115 items in this shard: test/test_ops.py::TestSelfKwarg::test_self_kwargs, test/test_ops.py::TestCommonCUDA::test_compare_cpu___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mT_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_with_sizes_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expand_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_permute_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_acos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argwhere_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bincount_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_eye_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_put_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_inner_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_item_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cholesky_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_norm_subgradients_at_zero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_solve_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_tensorsolve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_matmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cosine_similarity_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_upsample_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_fro_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_outer_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randint_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_list_args_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_topk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unravel_index_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsafe_split_cuda, test/test_ops.py::TestCommonCUDA::test_errors_T_cuda, test/test_ops.py::TestCommonCUDA::test_errors_arange_cuda, test/test_ops.py::TestCommonCUDA::test_errors_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_errors_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_errors_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_errors_item_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_errors_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_multilabel_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_errors_trace_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumulative_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_sparse_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rdiv___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagflat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_2inputs_2outputs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_4_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_consecutive_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsafe_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_static_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_kaiser_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_warning___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clone_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_equal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flip_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_le_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_factor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_triangular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_median_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_batch_norm_without_cudnn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_ctc_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_area_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_linear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_circular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resize__cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_select_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_list_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_le_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_shapes_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_gelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argwhere_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_combinations_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_unary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vander_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matrix_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vstack_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_movedim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_xlogy_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___radd___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rpow___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_asin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_asinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isinf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_xor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_renorm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmm_decomposed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_asinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_baddbmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cartesian_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diag_embed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expand_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_full_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_inner_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_int_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_fro_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rot90_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scalar_tensor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_slice_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_svd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_t_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_uniform_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zero__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_acosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expm1_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_item_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_randn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rsqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_triu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_asinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eig_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ne_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_reshape_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resize_as__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_roll_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tensor_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_true_divide_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_uniform_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zero__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rdiv___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmul___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rpow___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_float_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rad2deg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_to_size_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_as_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_abs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addcdiv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argwhere_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_asinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_baddbmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bernoulli_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_byte_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagflat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_eq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_floor_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isnan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isreal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_multi_dot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_log_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_var_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nextafter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_normalize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_replicate_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softsign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_outer_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_renorm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signbit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_airy_ai_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_y1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_w_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_with_sizes_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_mean_unbiased_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_where_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_xlogy_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rxor___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gcd_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rand___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rxor___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cfloat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gcd_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_blackman_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_bfloat16, test/test_ops.py::TestTagsCUDA::test_tags_H_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___radd___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rpow___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__chunk_cat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_T_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_bucketize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_equal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gcd_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isposinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_minimum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_narrow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal__in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_pow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i0e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sum_to_size_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unfold_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_vstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addcmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_and_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_shapes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chalf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diagonal_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_digamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_einsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_fft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_fftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_float_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_frac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_heaviside_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_igamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_igammac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_unary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_det_singular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_svdvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vector_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log10_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_argmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_cumprod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_narrow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_relu6_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rad2deg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_hann_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_slice_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_svd_lowrank_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_topk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trapezoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_uniform_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_where_cuda_float32 2024-04-03T12:34:12.9207795Z 2024-04-03T12:34:12.9208426Z Running test_unary_ufuncs 1/1 ... [2024-04-03 12:34:12.577201] 2024-04-03T12:34:12.9210527Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_unary_ufuncs.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:34:12.577812] 2024-04-03T12:37:36.0994355Z 2024-04-03T12:37:36.1002747Z test_ops 6/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_6.10_4e3eaa86326da093_.log 2024-04-03T12:37:36.3046617Z Running 3262 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_static_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rand_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_resolve_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes_H_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_offsets_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_baddbmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cummin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mT_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_grid_sample_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multilabel_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_polygamma_special_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_with_sizes_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rand___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_errors__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_H_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___radd___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_permuted_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pad_circular_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_static_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_3_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tile_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_sparse_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___radd___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_arange_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eye_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_subgradients_at_zero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_unpack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_median_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_msort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_with_logits_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_bartlett_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zeros_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___getitem___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rmatmul___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_argwhere_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_block_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_inverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dist_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gather_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_geqrf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gradient_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_hstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svdvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_matrix_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_nuc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_normal_in_place_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_interleave_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_transpose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addcmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_masked_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bool_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cartesian_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_char_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_combinations_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eye_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_geqrf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isfinite_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_triangular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_not_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_circular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softsign_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_outer_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_positive_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_interleave_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tensordot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsafe_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmatmul___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__batch_norm_with_update_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_polar_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_allclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atan2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_equal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lerp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_and_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trunc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__upsample_bilinear2d_aa_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_min_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diff_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_floor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_geqrf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_singular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matrix_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_celu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_kl_div_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_fro_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_4_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_interleave_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_searchsorted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_entr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_vdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_copy_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_igammac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestTagsCUDA::test_tags___rdiv___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_embed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_floor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_floor_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_maximum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rad2deg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_as_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addbmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_char_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_inner_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_xor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matrix_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_mish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softsign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pca_lowrank_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_permute_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randint_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reciprocal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reshape_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reshape_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_blackman_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_sparse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trapz_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zeros_like_cuda_float32 2024-04-03T12:37:36.4621454Z 2024-04-03T12:37:36.4622069Z Running test_binary_ufuncs 1/1 ... [2024-04-03 12:37:36.109391] 2024-04-03T12:37:36.4624119Z Executing ['/opt/conda/envs/py_3.8/bin/python', '-bb', 'test_binary_ufuncs.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-04-03 12:37:36.110000] 2024-04-03T12:40:26.9579807Z 2024-04-03T12:40:26.9583644Z test_binary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_binary_ufuncs_1.1_f1b11807bec3c907_.log 2024-04-03T12:40:27.7356954Z Running 12830 items in this shard: test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_broadcast_empty_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_with_tail_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addcmul_scalars_as_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addsub_half_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_edgecases_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_scalar_device_unspecified_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_ops_with_scalars_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bool_tensor_comparison_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cmul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpu_tensor_pow_cuda_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cremainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_binary_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_inplace_error_msg_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_csub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cuda_tensor_pow_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cumulative_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_script_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divmul_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_exceptions_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_idiv_and_ifloordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_division_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_dunders_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_and_float_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_tensor_pow_neg_ints_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_with_nontrivial_alignment_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_long_tensor_pow_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_forward_ad_float32_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_chalf_tensor_and_cpu_scalar_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_bfloat16_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_out_resize_warning_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_failing_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_failing_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_inplace_resizing_exception_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_base_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_overloads_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_overflow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rpow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_typing_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_tensor_pow_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___radd___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rand___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rdiv___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmod___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmul___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___ror___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rpow___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rsub___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rxor___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_return_by_ref_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_max_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_min_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_h_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_he_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_laguerre_polynomial_l_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_legendre_polynomial_p_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_bfloat16_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_gradients_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_uint8 2024-04-03T12:40:28.4888864Z 2024-04-03T12:41:02.1767870Z 2024-04-03T12:41:02.1771718Z test_unary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_unary_ufuncs_1.1_91fe2da0719b659f_.log 2024-04-03T12:41:03.6306302Z Running 24947 items in this shard: test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_big_number_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_complex_edge_values_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_slow_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_assert_raises_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_bfloat16_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_empty_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_op_invert_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_polygamma_neg_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_unary_out_op_mem_overlap_cuda_float64 2024-04-03T12:41:05.0336919Z 2024-04-03T12:41:24.1992796Z 2024-04-03T12:41:24.1996464Z test_ops 9/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_9.10_9497e0f5766be958_.log 2024-04-03T12:41:24.3950055Z Running 3233 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ravel_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___ror___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_arange_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_full_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histogram_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_int_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_matrix_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanmean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_ctc_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_instance_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rand_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resolve_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_select_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_short_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_errors_median_cuda, test/test_ops.py::TestCommonCUDA::test_errors_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kthvalue_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_airy_ai_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values__chunk_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_angle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nansum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_cosine_embedding_loss_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resolve_neg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_slice_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_legendre_polynomial_p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsafe_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zeros_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_addbmm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorinv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_rms_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_hamming_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_transpose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rand___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_equal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_round_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_char_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_inner_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logcumsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanmedian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_mse_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_upsample_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_fro_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rand_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_where_cuda, test/test_ops.py::TestCommonCUDA::test_out_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_arange_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pdist_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pdist_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_half_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_block_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log10_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_movedim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_roll_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rot90_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_transpose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_allclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cov_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_float_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_imag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_istft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_return_by_ref_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cond_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log1p_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_randn_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_along_dim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_trace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_all_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eye_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_float_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isnan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log10_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log1p_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_squeeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmm_decomposed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_baddbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_broadcast_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_equal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expm1_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_gradient_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_binary_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_unary_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorinv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_long_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_linear_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ones_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_put_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_randn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resolve_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resolve_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scalar_tensor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sinc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sparse_sampled_addmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_squeeze_multiple_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zeros_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addcdiv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addcmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diag_embed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eye_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_frexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_heaviside_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isreal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_masked_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_permute_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_remainder_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_signbit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_triu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_true_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_var_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addcmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cholesky_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_combinations_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_count_nonzero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cov_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_digamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_einsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_heaviside_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_igamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_put_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_item_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_kthvalue_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ldexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_normalize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_pool2d_with_indices_backward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_multinomial_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_narrow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_similarity_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_logsigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_normal_number_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_vsplit_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_floor_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isfinite_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_uniform_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex128, test/test_ops.py::TestTagsCUDA::test_tags_T_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___ror___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_acos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addcmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_all_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_any_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ravel_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_real_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rsqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_erfcx_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bincount_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_block_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_combinations_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_corrcoef_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diag_embed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diff_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_fftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_hfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_hfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_float_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gather_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ge_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_grid_sampler_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_kron_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mT_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_minimum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanmean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanquantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_normalize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_static_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_fro_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_nuc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polar_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_quantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rand_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_searchsorted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_hamming_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_erfcx_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sum_to_size_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsafe_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsafe_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vstack_cuda_float32 2024-04-03T12:41:24.5516606Z 2024-04-03T12:44:05.7368897Z 2024-04-03T12:44:05.7375979Z test_ops 10/10 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_10.10_21964ae873e2930d_.log 2024-04-03T12:44:05.9256058Z Running 3088 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_H_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nansum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__batch_norm_with_update_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_asin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clone_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_randn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_angle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_asin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diff_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_det_singular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_householder_product_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_singular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lu_unpack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_embedding_bag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardswish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_kl_div_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_local_response_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_softsign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polar_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize_as__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_errors_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_neg_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_polar_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_errors_take_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmod___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rsub___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_unary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_circular_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_negative_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_4inputs_with_extra_args_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize_as__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scalar_tensor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_slice_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_airy_ai_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_scaled_modified_bessel_k1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mT_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pinverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_rms_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_hann_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_integral_dtype__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_out_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_T_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_char_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_istft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log10_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addbmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_all_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clone_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_corrcoef_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cummin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histogram_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_put_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_householder_product_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_ldl_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mT_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_matmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_fractional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_multilabel_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_constant_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resolve_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sparse_mm_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__flash_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hinge_embedding_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_eq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_where_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rdiv___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_equal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expand_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_eye_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_acosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_angle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_det_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_det_singular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_householder_product_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_var_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_movedim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nansum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_linear_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_outer_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sgn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sinc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_list_args_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_with_sizes_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unbind_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zeros_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_double_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_any_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_equal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_hsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isreal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_permute_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rot90_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_std_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tril_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_where_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_all_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_inverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diagflat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diff_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_gather_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isnan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vecdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log10_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_matmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_reflect_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_fro_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_short_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trapz_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tril_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_triu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_where_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ceil_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_floor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_floor_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isnan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isneginf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_renorm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_entr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__segment_reduce_offsets_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__softmax_backward_data_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_aminmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_block_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bucketize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cholesky_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_constant_pad_nd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_embed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_double_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_equal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_float_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gradient_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isposinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_kron_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cholesky_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_det_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_det_singular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_triangular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_svdvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_tensorsolve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanmedian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_glu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_silu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_inf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_put_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_remainder_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reshape_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_mm_reduce_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_with_sizes_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_svd_lowrank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trunc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_mean_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_count_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int8, test/test_ops.py::TestTagsCUDA::test_tags__batch_norm_with_update_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_char_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_half_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_or_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cumprod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_eye_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isfinite_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_permute_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_randn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_reshape_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_signbit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_logit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_zeta_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unbind_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_allclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_aminmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_baddbmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_byte_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_conj_physical_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_floor_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_empty_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_fft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_geometric_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gradient_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isfinite_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eig_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_inv_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lstsq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_elu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_repeat_interleave_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize_as__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_short_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_airy_ai_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tensordot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_triu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_true_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unique_consecutive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_real_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_view_copy_cuda_float32 2024-04-03T12:44:06.0747970Z 2024-04-03T12:44:08.3166035Z 2024-04-03T12:44:08.3167187Z real 60m44.115s 2024-04-03T12:44:08.3168378Z user 214m56.548s 2024-04-03T12:44:08.3171000Z sys 50m56.581s 2024-04-03T12:44:08.3173372Z + assert_git_not_dirty 2024-04-03T12:44:08.3175610Z + [[ linux-focal-rocm6.0-py3.8 != *rocm* ]] 2024-04-03T12:44:08.3177374Z + test_aten 2024-04-03T12:44:08.3179431Z + echo 'Running ATen tests with pytorch lib' 2024-04-03T12:44:08.3182666Z Running ATen tests with pytorch lib 2024-04-03T12:44:08.3184690Z + [[ -n '' ]] 2024-04-03T12:44:08.3186085Z + echo 'Running test with the build folder' 2024-04-03T12:44:08.3187685Z Running test with the build folder 2024-04-03T12:44:08.3189093Z + TEST_BASE_DIR=build/bin 2024-04-03T12:44:08.3193053Z + ln -sf /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libc10.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libc10_hip.so build/bin 2024-04-03T12:44:08.3198830Z + ln -sf /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libcaffe2_nvrtc.so build/bin 2024-04-03T12:44:08.3217428Z + ln -sf '/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libmkldnn*' build/bin 2024-04-03T12:44:08.3234546Z + ln -sf '/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libnccl*' build/bin 2024-04-03T12:44:08.3268480Z + ln -sf /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorch.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorch_global_deps.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorch_hip.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorch_python.so /opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtorchbind_test.so build/bin 2024-04-03T12:44:08.3285114Z + ln -sf '/opt/conda/envs/py_3.8/lib/python3.8/site-packages/torch/lib/libtbb*' build/bin 2024-04-03T12:44:08.3302339Z + ls build/bin 2024-04-03T12:44:08.3336833Z CMakeFiles 2024-04-03T12:44:08.3338175Z CTestTestfile.cmake 2024-04-03T12:44:08.3339852Z CppSignature_test 2024-04-03T12:44:08.3341047Z Dict_test 2024-04-03T12:44:08.3342259Z Dimname_test 2024-04-03T12:44:08.3343415Z FileStoreTest 2024-04-03T12:44:08.3344545Z HashStoreTest 2024-04-03T12:44:08.3345635Z IListRef_test 2024-04-03T12:44:08.3346778Z KernelFunction_test 2024-04-03T12:44:08.3348111Z List_test 2024-04-03T12:44:08.3349170Z MaybeOwned_test 2024-04-03T12:44:08.3350461Z NamedTensor_test 2024-04-03T12:44:08.3351647Z ProcessGroupGlooTest 2024-04-03T12:44:08.3352875Z StorageUtils_test 2024-04-03T12:44:08.3354217Z TCPStoreTest 2024-04-03T12:44:08.3355378Z aot_model_compiler_test 2024-04-03T12:44:08.3356624Z apply_utils_test 2024-04-03T12:44:08.3357711Z atest 2024-04-03T12:44:08.3359533Z backend_fallback_test 2024-04-03T12:44:08.3361649Z basic 2024-04-03T12:44:08.3363624Z broadcast_test 2024-04-03T12:44:08.3364749Z c10_Bitset_test 2024-04-03T12:44:08.3365985Z c10_CompileTimeFunctionPointer_test 2024-04-03T12:44:08.3367477Z c10_ConstexprCrc_test 2024-04-03T12:44:08.3369345Z c10_DeadlockDetection_test 2024-04-03T12:44:08.3370876Z c10_DeviceGuard_test 2024-04-03T12:44:08.3372025Z c10_Device_test 2024-04-03T12:44:08.3373139Z c10_DispatchKeySet_test 2024-04-03T12:44:08.3374353Z c10_Half_test 2024-04-03T12:44:08.3375593Z c10_InlineDeviceGuard_test 2024-04-03T12:44:08.3376918Z c10_InlineStreamGuard_test 2024-04-03T12:44:08.3378190Z c10_LeftRight_test 2024-04-03T12:44:08.3379355Z c10_Metaprogramming_test 2024-04-03T12:44:08.3381037Z c10_Scalar_test 2024-04-03T12:44:08.3382170Z c10_SizesAndStrides_test 2024-04-03T12:44:08.3383418Z c10_StreamGuard_test 2024-04-03T12:44:08.3384584Z c10_SymInt_test 2024-04-03T12:44:08.3385684Z c10_Synchronized_test 2024-04-03T12:44:08.3387141Z c10_ThreadLocal_test 2024-04-03T12:44:08.3388321Z c10_TypeIndex_test 2024-04-03T12:44:08.3389467Z c10_TypeList_test 2024-04-03T12:44:08.3390569Z c10_TypeTraits_test 2024-04-03T12:44:08.3391457Z c10_accumulate_test 2024-04-03T12:44:08.3392277Z c10_bfloat16_test 2024-04-03T12:44:08.3393066Z c10_bit_cast_test 2024-04-03T12:44:08.3393878Z c10_complex_math_test 2024-04-03T12:44:08.3394727Z c10_complex_test 2024-04-03T12:44:08.3395530Z c10_cow_test 2024-04-03T12:44:08.3396465Z c10_exception_test 2024-04-03T12:44:08.3397265Z c10_flags_test 2024-04-03T12:44:08.3398034Z c10_generic_math_test 2024-04-03T12:44:08.3398943Z c10_hip_HIPAssertionsTest_1_var_test 2024-04-03T12:44:08.3400240Z c10_hip_HIPAssertionsTest_catches_stream 2024-04-03T12:44:08.3401776Z c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2024-04-03T12:44:08.3403183Z c10_hip_HIPAssertionsTest_from_2_processes 2024-04-03T12:44:08.3404606Z c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2024-04-03T12:44:08.3406257Z c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2024-04-03T12:44:08.3407873Z c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2024-04-03T12:44:08.3409117Z c10_hip_HIPTest 2024-04-03T12:44:08.3410001Z c10_intrusive_ptr_benchmark 2024-04-03T12:44:08.3410934Z c10_intrusive_ptr_test 2024-04-03T12:44:08.3411771Z c10_irange_test 2024-04-03T12:44:08.3412538Z c10_logging_test 2024-04-03T12:44:08.3413316Z c10_optional_test 2024-04-03T12:44:08.3414182Z c10_ordered_preserving_dict_test 2024-04-03T12:44:08.3415160Z c10_registry_test 2024-04-03T12:44:08.3415938Z c10_small_vector_test 2024-04-03T12:44:08.3416765Z c10_ssize_test 2024-04-03T12:44:08.3417534Z c10_string_util_test 2024-04-03T12:44:08.3418375Z c10_string_view_test 2024-04-03T12:44:08.3419191Z c10_tempfile_test 2024-04-03T12:44:08.3420086Z c10_typeid_test 2024-04-03T12:44:08.3420881Z cmake_install.cmake 2024-04-03T12:44:08.3421708Z cpu_allocator_test 2024-04-03T12:44:08.3422527Z cpu_generator_test 2024-04-03T12:44:08.3423380Z cpu_profiling_allocator_test 2024-04-03T12:44:08.3424316Z cpu_rng_test 2024-04-03T12:44:08.3425081Z dispatch_key_set_test 2024-04-03T12:44:08.3426178Z dlconvertor_test 2024-04-03T12:44:08.3427233Z example_allreduce 2024-04-03T12:44:08.3428059Z extension_backend_test 2024-04-03T12:44:08.3428951Z half_test 2024-04-03T12:44:08.3429787Z hip_apply_test 2024-04-03T12:44:08.3430581Z hip_complex_math_test 2024-04-03T12:44:08.3431420Z hip_complex_test 2024-04-03T12:44:08.3432224Z hip_distributions_test 2024-04-03T12:44:08.3433103Z hip_dlconvertor_test 2024-04-03T12:44:08.3433935Z hip_generator_test 2024-04-03T12:44:08.3434729Z hip_half_test 2024-04-03T12:44:08.3435515Z hip_integer_divider_test 2024-04-03T12:44:08.3436394Z hip_optional_test 2024-04-03T12:44:08.3437248Z hip_packedtensoraccessor_test 2024-04-03T12:44:08.3438193Z hip_vectorized_test 2024-04-03T12:44:08.3439020Z inline_container_test 2024-04-03T12:44:08.3439840Z ivalue_test 2024-04-03T12:44:08.3440619Z kernel_function_legacy_test 2024-04-03T12:44:08.3441532Z kernel_function_test 2024-04-03T12:44:08.3442383Z kernel_lambda_legacy_test 2024-04-03T12:44:08.3443264Z kernel_lambda_test 2024-04-03T12:44:08.3444076Z kernel_stackbased_test 2024-04-03T12:44:08.3444927Z lazy_tensor_test 2024-04-03T12:44:08.3445879Z legacy_vmap_test 2024-04-03T12:44:08.3446736Z libc10.so 2024-04-03T12:44:08.3447456Z libc10_hip.so 2024-04-03T12:44:08.3448217Z libcaffe2_nvrtc.so 2024-04-03T12:44:08.3449153Z 'libmkldnn*' 2024-04-03T12:44:08.3449931Z 'libnccl*' 2024-04-03T12:44:08.3450684Z 'libtbb*' 2024-04-03T12:44:08.3451389Z libtorch.so 2024-04-03T12:44:08.3452125Z libtorch_cpu.so 2024-04-03T12:44:08.3452927Z libtorch_global_deps.so 2024-04-03T12:44:08.3453803Z libtorch_hip.so 2024-04-03T12:44:08.3454602Z libtorch_python.so 2024-04-03T12:44:08.3455435Z libtorchbind_test.so 2024-04-03T12:44:08.3456330Z make_boxed_from_unboxed_functor_test 2024-04-03T12:44:08.3457341Z math_kernel_test 2024-04-03T12:44:08.3458273Z memory_format_test 2024-04-03T12:44:08.3459118Z memory_overlapping_test 2024-04-03T12:44:08.3460125Z mobile_memory_cleanup 2024-04-03T12:44:08.3460971Z native_test 2024-04-03T12:44:08.3461712Z op_allowlist_test 2024-04-03T12:44:08.3462542Z op_registration_test 2024-04-03T12:44:08.3463390Z operator_name_test 2024-04-03T12:44:08.3464208Z operators_test 2024-04-03T12:44:08.3465023Z packedtensoraccessor_test 2024-04-03T12:44:08.3465946Z parallel_benchmark 2024-04-03T12:44:08.3466906Z pow_test 2024-04-03T12:44:08.3467616Z protoc 2024-04-03T12:44:08.3468440Z protoc-3.13.0.0 2024-04-03T12:44:08.3469217Z quantized_test 2024-04-03T12:44:08.3469989Z reduce_ops_test 2024-04-03T12:44:08.3470814Z reportMemoryUsage_test 2024-04-03T12:44:08.3471678Z scalar_tensor_test 2024-04-03T12:44:08.3472467Z scalar_test 2024-04-03T12:44:08.3473221Z static_runtime_bench 2024-04-03T12:44:08.3474060Z static_runtime_test 2024-04-03T12:44:08.3474893Z stride_properties_test 2024-04-03T12:44:08.3475761Z tensor_iterator_test 2024-04-03T12:44:08.3476575Z test_api 2024-04-03T12:44:08.3477294Z test_cpp_rpc 2024-04-03T12:44:08.3478055Z test_dist_autograd 2024-04-03T12:44:08.3478882Z test_edge_op_registration 2024-04-03T12:44:08.3479776Z test_jit 2024-04-03T12:44:08.3480502Z test_lazy 2024-04-03T12:44:08.3481246Z test_mobile_nnc 2024-04-03T12:44:08.3482048Z test_parallel 2024-04-03T12:44:08.3482816Z test_tensorexpr 2024-04-03T12:44:08.3483601Z thread_init_test 2024-04-03T12:44:08.3484396Z torch_shm_manager 2024-04-03T12:44:08.3485215Z tutorial_tensorexpr 2024-04-03T12:44:08.3486054Z type_ptr_test 2024-04-03T12:44:08.3486812Z type_test 2024-04-03T12:44:08.3487557Z undefined_tensor_test 2024-04-03T12:44:08.3488418Z vec_test_all_types_AVX2 2024-04-03T12:44:08.3489319Z vec_test_all_types_AVX512 2024-04-03T12:44:08.3490225Z vec_test_all_types_DEFAULT 2024-04-03T12:44:08.3491126Z verify_api_visibility 2024-04-03T12:44:08.3491974Z weakref_test 2024-04-03T12:44:08.3492727Z wrapdim_test 2024-04-03T12:44:08.3493508Z xla_tensor_test 2024-04-03T12:44:08.3494348Z + aten/tools/run_tests.sh build/bin 2024-04-03T12:44:08.3495382Z + set -e 2024-04-03T12:44:08.3496158Z ++ dirname aten/tools/run_tests.sh 2024-04-03T12:44:08.3497474Z + VALGRIND_SUP=/var/lib/jenkins/pytorch/aten/tools/valgrind.sup 2024-04-03T12:44:08.3498842Z + export CPP_TESTS_DIR=build/bin 2024-04-03T12:44:08.3500084Z + CPP_TESTS_DIR=build/bin 2024-04-03T12:44:08.3501004Z + VALGRIND=OFF 2024-04-03T12:44:08.3507119Z + python test/run_test.py --cpp --verbose -i cpp/basic cpp/atest cpp/scalar_test cpp/broadcast_test cpp/wrapdim_test cpp/apply_utils_test cpp/dlconvertor_test cpp/native_test cpp/scalar_tensor_test cpp/undefined_tensor_test cpp/extension_backend_test cpp/lazy_tensor_test cpp/tensor_iterator_test cpp/Dimname_test cpp/Dict_test cpp/NamedTensor_test cpp/cpu_generator_test cpp/legacy_vmap_test cpp/operators_test 2024-04-03T12:44:08.4492166Z test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-04-03T12:44:08.4494901Z import pkg_resources 2024-04-03T12:44:11.5906888Z Downloading https://ossci-metrics.s3.amazonaws.com/slow-tests.json to /var/lib/jenkins/pytorch/test/.pytorch-slow-tests.json 2024-04-03T12:44:11.5912947Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2024-04-03T12:44:11.5985085Z Found test times from artifacts 2024-04-03T12:44:11.6287265Z Found test times from artifacts 2024-04-03T12:44:11.6297741Z Running all tests 2024-04-03T12:44:11.6302176Z Name: tests to run (est. time: 0.0min) 2024-04-03T12:44:11.6304300Z Serial tests (0): 2024-04-03T12:44:11.6305759Z Parallel tests (19): 2024-04-03T12:44:11.6307182Z cpp/Dict_test 1/1 2024-04-03T12:44:11.6308382Z cpp/Dimname_test 1/1 2024-04-03T12:44:11.6309681Z cpp/NamedTensor_test 1/1 2024-04-03T12:44:11.6311003Z cpp/apply_utils_test 1/1 2024-04-03T12:44:11.6312290Z cpp/atest 1/1 2024-04-03T12:44:11.6313937Z cpp/basic 1/1 2024-04-03T12:44:11.6315137Z cpp/broadcast_test 1/1 2024-04-03T12:44:11.6316444Z cpp/cpu_generator_test 1/1 2024-04-03T12:44:11.6317794Z cpp/dlconvertor_test 1/1 2024-04-03T12:44:11.6319128Z cpp/extension_backend_test 1/1 2024-04-03T12:44:11.6320597Z cpp/lazy_tensor_test 1/1 2024-04-03T12:44:11.6321914Z cpp/legacy_vmap_test 1/1 2024-04-03T12:44:11.6323185Z cpp/native_test 1/1 2024-04-03T12:44:11.6324395Z cpp/operators_test 1/1 2024-04-03T12:44:11.6325891Z cpp/scalar_tensor_test 1/1 2024-04-03T12:44:11.6327194Z cpp/scalar_test 1/1 2024-04-03T12:44:11.6328439Z cpp/tensor_iterator_test 1/1 2024-04-03T12:44:11.6329825Z cpp/undefined_tensor_test 1/1 2024-04-03T12:44:11.6331189Z cpp/wrapdim_test 1/1 2024-04-03T12:44:11.6332456Z Name: excluded (est. time: 0.0min) 2024-04-03T12:44:11.6333809Z Serial tests (0): 2024-04-03T12:44:11.6334949Z Parallel tests (0): 2024-04-03T12:44:11.6337034Z Starting test batch 'tests to run' 0.0 seconds after initiating testing 2024-04-03T12:44:13.9725338Z Running cpp/Dict_test 1/1 ... [2024-04-03 12:44:13.971545] 2024-04-03T12:44:13.9734313Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dict_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-61f0a9f057e0e3c0.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:13.972614] 2024-04-03T12:44:13.9860391Z Running cpp/Dimname_test 1/1 ... [2024-04-03 12:44:13.985126] 2024-04-03T12:44:13.9870811Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dimname_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-e69a01fa7d9a5412.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:13.986046] 2024-04-03T12:44:14.0847914Z Running cpp/NamedTensor_test 1/1 ... [2024-04-03 12:44:14.083772] 2024-04-03T12:44:14.0853976Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/NamedTensor_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-384a41d4db1ea629.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:14.084714] 2024-04-03T12:44:14.0951373Z Running cpp/apply_utils_test 1/1 ... [2024-04-03 12:44:14.094329] 2024-04-03T12:44:14.0969639Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/apply_utils_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-13493532c7d47654.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:14.095949] 2024-04-03T12:44:23.2773007Z 2024-04-03T12:44:23.2775923Z cpp/Dimname_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dimname_test_1.1_08c97e9aaefb0635_.log 2024-04-03T12:44:23.2778027Z 2024-04-03T12:44:23.2784826Z Running cpp/atest 1/1 ... [2024-04-03 12:44:23.278159] 2024-04-03T12:44:23.2797418Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/atest', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-c458be4a8031c92d.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:23.279188] 2024-04-03T12:44:25.1643268Z 2024-04-03T12:44:25.1647569Z cpp/apply_utils_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.apply_utils_test_1.1_68d9c458701743b3_.log 2024-04-03T12:44:25.1650981Z 2024-04-03T12:44:25.1651921Z Running cpp/basic 1/1 ... [2024-04-03 12:44:25.164719] 2024-04-03T12:44:25.1664307Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/basic', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-f7e01420c8f418e6.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:25.165677] 2024-04-03T12:44:26.8795192Z 2024-04-03T12:44:26.8799194Z cpp/NamedTensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.NamedTensor_test_1.1_a8dfdbdfb46be735_.log 2024-04-03T12:44:26.8802188Z 2024-04-03T12:44:26.8807442Z Running cpp/broadcast_test 1/1 ... [2024-04-03 12:44:26.880397] 2024-04-03T12:44:26.8822818Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/broadcast_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-84cfaae63398e9d4.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:26.881449] 2024-04-03T12:44:34.8732373Z 2024-04-03T12:44:34.8734459Z cpp/broadcast_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.broadcast_test_1.1_a1b53734d2985b07_.log 2024-04-03T12:44:34.8736682Z 2024-04-03T12:44:34.8738805Z Running cpp/cpu_generator_test 1/1 ... [2024-04-03 12:44:34.873269] 2024-04-03T12:44:34.8754863Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/cpu_generator_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-62a2ef0d44e26f89.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:34.874511] 2024-04-03T12:44:35.0978951Z 2024-04-03T12:44:35.0983138Z cpp/basic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.basic_1.1_8ad405c68b3c9450_.log 2024-04-03T12:44:35.0986616Z 2024-04-03T12:44:35.0988359Z Running cpp/dlconvertor_test 1/1 ... [2024-04-03 12:44:35.097825] 2024-04-03T12:44:35.0997910Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/dlconvertor_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-7537d978005ed6fc.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:35.098932] 2024-04-03T12:44:37.5387725Z 2024-04-03T12:44:37.5389982Z cpp/atest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.atest_1.1_b3ed596288111141_.log 2024-04-03T12:44:37.5392707Z 2024-04-03T12:44:37.5423511Z Running cpp/extension_backend_test 1/1 ... [2024-04-03 12:44:37.541566] 2024-04-03T12:44:37.5472918Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/extension_backend_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-82e4b953989ac41a.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:37.546357] 2024-04-03T12:44:43.0920579Z 2024-04-03T12:44:43.0925495Z cpp/dlconvertor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.dlconvertor_test_1.1_246a6a66396b482a_.log 2024-04-03T12:44:43.0928765Z 2024-04-03T12:44:43.0930046Z Running cpp/lazy_tensor_test 1/1 ... [2024-04-03 12:44:43.092006] 2024-04-03T12:44:43.0937829Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/lazy_tensor_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-ca27edcfa6e392df.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:43.093037] 2024-04-03T12:44:45.7687020Z 2024-04-03T12:44:45.7691508Z cpp/Dict_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dict_test_1.1_eea8d3f17ca7dd6e_.log 2024-04-03T12:44:45.7694630Z 2024-04-03T12:44:45.7696363Z Running cpp/legacy_vmap_test 1/1 ... [2024-04-03 12:44:45.769279] 2024-04-03T12:44:45.7728797Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/legacy_vmap_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-2cce9897dbde5277.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:45.770307] 2024-04-03T12:44:45.8727141Z 2024-04-03T12:44:45.8731003Z cpp/extension_backend_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.extension_backend_test_1.1_71867304633d616d_.log 2024-04-03T12:44:45.8733551Z 2024-04-03T12:44:45.8734452Z Running cpp/native_test 1/1 ... [2024-04-03 12:44:45.872864] 2024-04-03T12:44:45.8745620Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/native_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0fdf9ecc7550dd25.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:45.873818] 2024-04-03T12:44:48.5475177Z 2024-04-03T12:44:48.5489512Z cpp/cpu_generator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cpu_generator_test_1.1_740415e53527f4fe_.log 2024-04-03T12:44:48.5492895Z 2024-04-03T12:44:48.5494315Z Running cpp/operators_test 1/1 ... [2024-04-03 12:44:48.547387] 2024-04-03T12:44:48.5500536Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/operators_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-a65687febe18fc21.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:48.548376] 2024-04-03T12:44:50.5062236Z 2024-04-03T12:44:50.5067913Z cpp/lazy_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.lazy_tensor_test_1.1_c65241e3a4acb408_.log 2024-04-03T12:44:50.5071333Z 2024-04-03T12:44:50.5084456Z Running cpp/scalar_tensor_test 1/1 ... [2024-04-03 12:44:50.507819] 2024-04-03T12:44:50.5093337Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_tensor_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-9ff7dc171fc9cee6.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:50.508708] 2024-04-03T12:44:55.1479528Z 2024-04-03T12:44:55.1483161Z cpp/native_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.native_test_1.1_ca23ed2d0b9a9804_.log 2024-04-03T12:44:55.1486620Z 2024-04-03T12:44:55.1487913Z Running cpp/scalar_test 1/1 ... [2024-04-03 12:44:55.147693] 2024-04-03T12:44:55.1499972Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-29e21f73a6aae280.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:55.148789] 2024-04-03T12:44:56.2965825Z 2024-04-03T12:44:56.2969932Z cpp/operators_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.operators_test_1.1_3fca26950c6290e2_.log 2024-04-03T12:44:56.2973243Z 2024-04-03T12:44:56.2974601Z Running cpp/tensor_iterator_test 1/1 ... [2024-04-03 12:44:56.296187] 2024-04-03T12:44:56.2981643Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/tensor_iterator_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-f2c3ef0b14fa72b5.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:56.297172] 2024-04-03T12:44:58.8100362Z 2024-04-03T12:44:58.8104151Z cpp/scalar_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_tensor_test_1.1_04259dce2fa8a4a0_.log 2024-04-03T12:44:58.8107138Z 2024-04-03T12:44:58.8108291Z Running cpp/undefined_tensor_test 1/1 ... [2024-04-03 12:44:58.809778] 2024-04-03T12:44:58.8116684Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/undefined_tensor_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-381b0da75a322c21.xml', '-x', '--reruns=2'] ... [2024-04-03 12:44:58.810578] 2024-04-03T12:45:02.9576146Z 2024-04-03T12:45:02.9579871Z cpp/scalar_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_test_1.1_a524f8e38d588a13_.log 2024-04-03T12:45:02.9587029Z 2024-04-03T12:45:02.9629044Z Running cpp/wrapdim_test 1/1 ... [2024-04-03 12:45:02.961682] 2024-04-03T12:45:02.9633982Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/wrapdim_test', '-v', '-vv', '-rfEX', '-n', '4', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-26b86f669559b9b6.xml', '-x', '--reruns=2'] ... [2024-04-03 12:45:02.962768] 2024-04-03T12:45:04.6811161Z 2024-04-03T12:45:04.6813999Z cpp/legacy_vmap_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.legacy_vmap_test_1.1_09ca941fda98da41_.log 2024-04-03T12:45:04.6816118Z 2024-04-03T12:45:06.3056277Z 2024-04-03T12:45:06.3059105Z cpp/undefined_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.undefined_tensor_test_1.1_a70bc1eb9d45214c_.log 2024-04-03T12:45:06.3060653Z 2024-04-03T12:45:09.8576993Z 2024-04-03T12:45:09.8580187Z cpp/wrapdim_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.wrapdim_test_1.1_ed33bb88894fd28b_.log 2024-04-03T12:45:09.8583209Z 2024-04-03T12:45:30.3041096Z 2024-04-03T12:45:30.3044732Z cpp/tensor_iterator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.tensor_iterator_test_1.1_8b242e916db0b27c_.log 2024-04-03T12:45:30.3048479Z 2024-04-03T12:45:33.4052832Z + [[ -x ./tensor_interop_test ]] 2024-04-03T12:45:33.4054307Z + [[ -x ./cudnn_test ]] 2024-04-03T12:45:33.4055950Z + [[ -x ./cuda_generator_test ]] 2024-04-03T12:45:33.4057297Z + [[ -x ./apply_test ]] 2024-04-03T12:45:33.4058486Z + [[ -x ./stream_test ]] 2024-04-03T12:45:33.4059945Z + [[ -x ./cuda_half_test ]] 2024-04-03T12:45:33.4061384Z + [[ -x ./cuda_vectorized_test ]] 2024-04-03T12:45:33.4062811Z + [[ -x ./cuda_distributions_test ]] 2024-04-03T12:45:33.4064293Z + [[ -x ./cuda_optional_test ]] 2024-04-03T12:45:33.4065772Z + [[ -x ./cuda_tensor_interop_test ]] 2024-04-03T12:45:33.4067328Z + [[ -x ./cuda_complex_test ]] 2024-04-03T12:45:33.4069128Z + [[ -x ./cuda_complex_math_test ]] 2024-04-03T12:45:33.4070526Z + [[ -x ./cuda_cub_test ]] 2024-04-03T12:45:33.4071814Z + [[ -x ./cuda_atomic_ops_test ]] 2024-04-03T12:45:33.4073113Z + '[' OFF == ON ']' 2024-04-03T12:45:33.4074172Z + [[ -n '' ]] 2024-04-03T12:45:33.4075161Z + assert_git_not_dirty 2024-04-03T12:45:33.4076525Z + [[ linux-focal-rocm6.0-py3.8 != *rocm* ]] 2024-04-03T12:45:33.4162259Z ##[debug]CONTAINER_NAME='48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176' 2024-04-03T12:45:33.4167861Z ##[debug]Finishing: Test 2024-04-03T12:45:33.4191849Z ##[debug]Evaluating condition for step: 'Save test results' 2024-04-03T12:45:33.4195900Z ##[debug]Evaluating: always() 2024-04-03T12:45:33.4197130Z ##[debug]Evaluating always: 2024-04-03T12:45:33.4198381Z ##[debug]=> true 2024-04-03T12:45:33.4199668Z ##[debug]Result: true 2024-04-03T12:45:33.4201398Z ##[debug]Starting: Save test results 2024-04-03T12:45:33.4223401Z ##[debug]Loading inputs 2024-04-03T12:45:33.4227475Z ##[debug]Evaluating: format('# copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4230253Z ##[debug]docker exec -t "{0}" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-04-03T12:45:33.4231963Z ##[debug]', env.CONTAINER_NAME) 2024-04-03T12:45:33.4232931Z ##[debug]Evaluating format: 2024-04-03T12:45:33.4233873Z ##[debug]..Evaluating String: 2024-04-03T12:45:33.4235741Z ##[debug]..=> '# copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4238285Z ##[debug]docker exec -t "{0}" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-04-03T12:45:33.4239916Z ##[debug]' 2024-04-03T12:45:33.4240880Z ##[debug]..Evaluating Index: 2024-04-03T12:45:33.4241797Z ##[debug]....Evaluating env: 2024-04-03T12:45:33.4242700Z ##[debug]....=> Object 2024-04-03T12:45:33.4243582Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.4244554Z ##[debug]....=> 'CONTAINER_NAME' 2024-04-03T12:45:33.4250630Z ##[debug]..=> '48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176' 2024-04-03T12:45:33.4253332Z ##[debug]=> '# copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4256570Z ##[debug]docker exec -t "48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-04-03T12:45:33.4258930Z ##[debug]' 2024-04-03T12:45:33.4260946Z ##[debug]Result: '# copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4264500Z ##[debug]docker exec -t "48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-04-03T12:45:33.4266890Z ##[debug]' 2024-04-03T12:45:33.4267884Z ##[debug]Loading env 2024-04-03T12:45:33.4278541Z ##[group]Run # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4280311Z # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-04-03T12:45:33.4282405Z docker exec -t "48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-04-03T12:45:33.4320491Z shell: /usr/bin/bash -e {0} 2024-04-03T12:45:33.4321002Z env: 2024-04-03T12:45:33.4321385Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:33.4321923Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:33.4322941Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:33.4323897Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:33.4324427Z AWS_REGION: us-east-1 2024-04-03T12:45:33.4325096Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:33.4325748Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:33.4334554Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:33.4335284Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:33.4336113Z ##[endgroup] 2024-04-03T12:45:33.4368685Z ##[debug]/usr/bin/bash -e /home/pytorchci/actions-runner/_work/_temp/2e20787d-b5ed-4a12-86f1-fecd23020799.sh 2024-04-03T12:45:33.5325004Z sudo: setrlimit(RLIMIT_STACK): Operation not permitted 2024-04-03T12:45:33.5761438Z ##[debug]Finishing: Save test results 2024-04-03T12:45:33.5786335Z ##[debug]Evaluating condition for step: 'Print remaining test logs' 2024-04-03T12:45:33.5791119Z ##[debug]Evaluating: (always() && steps.test.conclusion) 2024-04-03T12:45:33.5793290Z ##[debug]Evaluating And: 2024-04-03T12:45:33.5794847Z ##[debug]..Evaluating always: 2024-04-03T12:45:33.5796066Z ##[debug]..=> true 2024-04-03T12:45:33.5797106Z ##[debug]..Evaluating Index: 2024-04-03T12:45:33.5798264Z ##[debug]....Evaluating Index: 2024-04-03T12:45:33.5799435Z ##[debug]......Evaluating steps: 2024-04-03T12:45:33.5800618Z ##[debug]......=> Object 2024-04-03T12:45:33.5801748Z ##[debug]......Evaluating String: 2024-04-03T12:45:33.5802909Z ##[debug]......=> 'test' 2024-04-03T12:45:33.5803986Z ##[debug]....=> Object 2024-04-03T12:45:33.5805071Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.5806246Z ##[debug]....=> 'conclusion' 2024-04-03T12:45:33.5807359Z ##[debug]..=> 'success' 2024-04-03T12:45:33.5808414Z ##[debug]=> 'success' 2024-04-03T12:45:33.5809835Z ##[debug]Expanded: (true && 'success') 2024-04-03T12:45:33.5811178Z ##[debug]Result: 'success' 2024-04-03T12:45:33.5812707Z ##[debug]Starting: Print remaining test logs 2024-04-03T12:45:33.5831652Z ##[debug]Loading inputs 2024-04-03T12:45:33.5833638Z ##[debug]Loading env 2024-04-03T12:45:33.5845464Z ##[group]Run cat test/**/*_toprint.log || true 2024-04-03T12:45:33.5846581Z cat test/**/*_toprint.log || true 2024-04-03T12:45:33.5900181Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T12:45:33.5901340Z env: 2024-04-03T12:45:33.5901981Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:33.5902925Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:33.5904673Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:33.5906563Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:33.5907443Z AWS_REGION: us-east-1 2024-04-03T12:45:33.5908400Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:33.5909542Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:33.5920507Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:33.5921182Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:33.5921916Z ##[endgroup] 2024-04-03T12:45:33.5950644Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/15cbcc5f-f4f4-4053-9a07-ae6046a1a0b0.sh 2024-04-03T12:45:33.6123745Z cat: 'test/**/*_toprint.log': No such file or directory 2024-04-03T12:45:33.6136491Z ##[debug]Finishing: Print remaining test logs 2024-04-03T12:45:33.6161903Z ##[debug]Evaluating: steps.monitor-script.outputs.monitor-script-pid 2024-04-03T12:45:33.6163947Z ##[debug]Evaluating Index: 2024-04-03T12:45:33.6165168Z ##[debug]..Evaluating Index: 2024-04-03T12:45:33.6166409Z ##[debug]....Evaluating Index: 2024-04-03T12:45:33.6167667Z ##[debug]......Evaluating steps: 2024-04-03T12:45:33.6169245Z ##[debug]......=> Object 2024-04-03T12:45:33.6170434Z ##[debug]......Evaluating String: 2024-04-03T12:45:33.6171751Z ##[debug]......=> 'monitor-script' 2024-04-03T12:45:33.6173102Z ##[debug]....=> Object 2024-04-03T12:45:33.6174229Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.6175466Z ##[debug]....=> 'outputs' 2024-04-03T12:45:33.6176590Z ##[debug]..=> Object 2024-04-03T12:45:33.6177682Z ##[debug]..Evaluating String: 2024-04-03T12:45:33.6178970Z ##[debug]..=> 'monitor-script-pid' 2024-04-03T12:45:33.6180693Z ##[debug]=> '3036295' 2024-04-03T12:45:33.6181798Z ##[debug]Result: '3036295' 2024-04-03T12:45:33.6183619Z ##[debug]Evaluating condition for step: 'Stop monitoring script' 2024-04-03T12:45:33.6188581Z ##[debug]Evaluating: (always() && steps.monitor-script.outputs.monitor-script-pid) 2024-04-03T12:45:33.6191073Z ##[debug]Evaluating And: 2024-04-03T12:45:33.6192597Z ##[debug]..Evaluating always: 2024-04-03T12:45:33.6194129Z ##[debug]..=> true 2024-04-03T12:45:33.6195574Z ##[debug]..Evaluating Index: 2024-04-03T12:45:33.6196908Z ##[debug]....Evaluating Index: 2024-04-03T12:45:33.6198071Z ##[debug]......Evaluating Index: 2024-04-03T12:45:33.6199262Z ##[debug]........Evaluating steps: 2024-04-03T12:45:33.6200446Z ##[debug]........=> Object 2024-04-03T12:45:33.6201575Z ##[debug]........Evaluating String: 2024-04-03T12:45:33.6202835Z ##[debug]........=> 'monitor-script' 2024-04-03T12:45:33.6204023Z ##[debug]......=> Object 2024-04-03T12:45:33.6205144Z ##[debug]......Evaluating String: 2024-04-03T12:45:33.6206305Z ##[debug]......=> 'outputs' 2024-04-03T12:45:33.6207366Z ##[debug]....=> Object 2024-04-03T12:45:33.6208443Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.6209673Z ##[debug]....=> 'monitor-script-pid' 2024-04-03T12:45:33.6210877Z ##[debug]..=> '3036295' 2024-04-03T12:45:33.6211889Z ##[debug]=> '3036295' 2024-04-03T12:45:33.6213197Z ##[debug]Expanded: (true && '3036295') 2024-04-03T12:45:33.6214527Z ##[debug]Result: '3036295' 2024-04-03T12:45:33.6216002Z ##[debug]Starting: Stop monitoring script 2024-04-03T12:45:33.6235216Z ##[debug]Loading inputs 2024-04-03T12:45:33.6237140Z ##[debug]Loading env 2024-04-03T12:45:33.6248831Z ##[group]Run kill "$MONITOR_SCRIPT_PID" 2024-04-03T12:45:33.6249854Z kill "$MONITOR_SCRIPT_PID" 2024-04-03T12:45:33.6300458Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T12:45:33.6301585Z env: 2024-04-03T12:45:33.6302210Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:33.6303131Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:33.6304828Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:33.6306418Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:33.6306979Z AWS_REGION: us-east-1 2024-04-03T12:45:33.6307596Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:33.6308298Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:33.6317703Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:33.6318370Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:33.6319132Z MONITOR_SCRIPT_PID: 3036295 2024-04-03T12:45:33.6319561Z ##[endgroup] 2024-04-03T12:45:33.6347566Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/bdd6ddfd-f950-4f3f-9891-37024abd5572.sh 2024-04-03T12:45:33.6387123Z ##[debug]Finishing: Stop monitoring script 2024-04-03T12:45:33.6405650Z ##[debug]Evaluating condition for step: 'Upload test artifacts' 2024-04-03T12:45:33.6410179Z ##[debug]Evaluating: (always() && steps.test.conclusion && (steps.test.conclusion != 'skipped')) 2024-04-03T12:45:33.6412153Z ##[debug]Evaluating And: 2024-04-03T12:45:33.6413324Z ##[debug]..Evaluating always: 2024-04-03T12:45:33.6414478Z ##[debug]..=> true 2024-04-03T12:45:33.6415604Z ##[debug]..Evaluating Index: 2024-04-03T12:45:33.6416802Z ##[debug]....Evaluating Index: 2024-04-03T12:45:33.6418048Z ##[debug]......Evaluating steps: 2024-04-03T12:45:33.6419265Z ##[debug]......=> Object 2024-04-03T12:45:33.6420577Z ##[debug]......Evaluating String: 2024-04-03T12:45:33.6421991Z ##[debug]......=> 'test' 2024-04-03T12:45:33.6423114Z ##[debug]....=> Object 2024-04-03T12:45:33.6424264Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.6425474Z ##[debug]....=> 'conclusion' 2024-04-03T12:45:33.6426609Z ##[debug]..=> 'success' 2024-04-03T12:45:33.6427868Z ##[debug]..Evaluating NotEqual: 2024-04-03T12:45:33.6429165Z ##[debug]....Evaluating Index: 2024-04-03T12:45:33.6430370Z ##[debug]......Evaluating Index: 2024-04-03T12:45:33.6431624Z ##[debug]........Evaluating steps: 2024-04-03T12:45:33.6432830Z ##[debug]........=> Object 2024-04-03T12:45:33.6434032Z ##[debug]........Evaluating String: 2024-04-03T12:45:33.6435242Z ##[debug]........=> 'test' 2024-04-03T12:45:33.6436362Z ##[debug]......=> Object 2024-04-03T12:45:33.6437529Z ##[debug]......Evaluating String: 2024-04-03T12:45:33.6438748Z ##[debug]......=> 'conclusion' 2024-04-03T12:45:33.6439918Z ##[debug]....=> 'success' 2024-04-03T12:45:33.6441077Z ##[debug]....Evaluating String: 2024-04-03T12:45:33.6442256Z ##[debug]....=> 'skipped' 2024-04-03T12:45:33.6443341Z ##[debug]..=> true 2024-04-03T12:45:33.6444333Z ##[debug]=> true 2024-04-03T12:45:33.6446134Z ##[debug]Expanded: (true && 'success' && ('success' != 'skipped')) 2024-04-03T12:45:33.6447771Z ##[debug]Result: true 2024-04-03T12:45:33.6449207Z ##[debug]Starting: Upload test artifacts 2024-04-03T12:45:33.6644237Z Prepare all required actions 2024-04-03T12:45:33.6645136Z Getting action download info 2024-04-03T12:45:34.3244503Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2024-04-03T12:45:35.4230898Z ##[debug]Download 'https://api.github.com/repos/seemethere/upload-artifact-s3/tarball/baba72d0712b404f646cebe0730933554ebce96a' to '/home/pytorchci/actions-runner/_work/_actions/_temp_169a76be-0c72-4cec-b836-2eab27421b45/6241d1dd-28d4-471e-ac5e-20558f62c605.tar.gz' 2024-04-03T12:45:35.4815286Z ##[debug]Unwrap 'seemethere-upload-artifact-s3-baba72d' to '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5' 2024-04-03T12:45:35.4980765Z ##[debug]Archive '/home/pytorchci/actions-runner/_work/_actions/_temp_169a76be-0c72-4cec-b836-2eab27421b45/6241d1dd-28d4-471e-ac5e-20558f62c605.tar.gz' has been unzipped into '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5'. 2024-04-03T12:45:35.5048779Z ##[debug]Action 'seemethere/upload-artifact-s3@v5' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5'. 2024-04-03T12:45:35.5054039Z ##[debug]Action 'seemethere/upload-artifact-s3@v5' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5'. 2024-04-03T12:45:35.5059170Z ##[debug]Action 'actions/upload-artifact@v3' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3'. 2024-04-03T12:45:35.5064297Z ##[debug]Action 'actions/upload-artifact@v3' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3'. 2024-04-03T12:45:35.5069231Z ##[debug]Action 'actions/upload-artifact@v3' already downloaded at '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3'. 2024-04-03T12:45:35.5073039Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5/action.yml'. 2024-04-03T12:45:35.5121832Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5/action.yml'. 2024-04-03T12:45:35.5199510Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/seemethere/upload-artifact-s3/v5/action.yml'. 2024-04-03T12:45:35.5256062Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3/action.yml'. 2024-04-03T12:45:35.5291775Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3/action.yml'. 2024-04-03T12:45:35.5325440Z ##[debug]action.yml for action: '/home/pytorchci/actions-runner/_work/_actions/actions/upload-artifact/v3/action.yml'. 2024-04-03T12:45:35.5560963Z ##[debug]Loading inputs 2024-04-03T12:45:35.5563743Z ##[debug]Evaluating: format('{0}-{1}-{2}-{3}-{4}_{5}', github.job, matrix.config, matrix.shard, matrix.num_shards, matrix.runner, steps.get-job-id.outputs.job-id) 2024-04-03T12:45:35.5565149Z ##[debug]Evaluating format: 2024-04-03T12:45:35.5565864Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5566498Z ##[debug]..=> '{0}-{1}-{2}-{3}-{4}_{5}' 2024-04-03T12:45:35.5567294Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5567935Z ##[debug]....Evaluating github: 2024-04-03T12:45:35.5568542Z ##[debug]....=> Object 2024-04-03T12:45:35.5569131Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5569786Z ##[debug]....=> 'job' 2024-04-03T12:45:35.5570364Z ##[debug]..=> 'test' 2024-04-03T12:45:35.5570839Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5571416Z ##[debug]....Evaluating matrix: 2024-04-03T12:45:35.5571932Z ##[debug]....=> Object 2024-04-03T12:45:35.5572432Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5573044Z ##[debug]....=> 'config' 2024-04-03T12:45:35.5573655Z ##[debug]..=> 'default' 2024-04-03T12:45:35.5574255Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5574761Z ##[debug]....Evaluating matrix: 2024-04-03T12:45:35.5575272Z ##[debug]....=> Object 2024-04-03T12:45:35.5575841Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5576618Z ##[debug]....=> 'shard' 2024-04-03T12:45:35.5577257Z ##[debug]..=> 1 2024-04-03T12:45:35.5577852Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5578454Z ##[debug]....Evaluating matrix: 2024-04-03T12:45:35.5579012Z ##[debug]....=> Object 2024-04-03T12:45:35.5579576Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5580266Z ##[debug]....=> 'num_shards' 2024-04-03T12:45:35.5580853Z ##[debug]..=> 1 2024-04-03T12:45:35.5581313Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5581874Z ##[debug]....Evaluating matrix: 2024-04-03T12:45:35.5582429Z ##[debug]....=> Object 2024-04-03T12:45:35.5583125Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5583741Z ##[debug]....=> 'runner' 2024-04-03T12:45:35.5584376Z ##[debug]..=> 'linux.rocm.gpu' 2024-04-03T12:45:35.5584991Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5585584Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5586260Z ##[debug]......Evaluating Index: 2024-04-03T12:45:35.5586950Z ##[debug]........Evaluating steps: 2024-04-03T12:45:35.5587622Z ##[debug]........=> Object 2024-04-03T12:45:35.5588218Z ##[debug]........Evaluating String: 2024-04-03T12:45:35.5588916Z ##[debug]........=> 'get-job-id' 2024-04-03T12:45:35.5589600Z ##[debug]......=> Object 2024-04-03T12:45:35.5590258Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5590875Z ##[debug]......=> 'outputs' 2024-04-03T12:45:35.5591506Z ##[debug]....=> Object 2024-04-03T12:45:35.5592127Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5592779Z ##[debug]....=> 'job-id' 2024-04-03T12:45:35.5593642Z ##[debug]..=> '23389745826' 2024-04-03T12:45:35.5594384Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5595320Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5597344Z ##[debug]Loading env 2024-04-03T12:45:35.5605551Z ##[group]Run ./.github/actions/upload-test-artifacts 2024-04-03T12:45:35.5606307Z with: 2024-04-03T12:45:35.5606764Z use-gha: true 2024-04-03T12:45:35.5607596Z file-suffix: test-default-1-1-linux.rocm.gpu_23389745826 2024-04-03T12:45:35.5608417Z s3-bucket: gha-artifacts 2024-04-03T12:45:35.5608965Z env: 2024-04-03T12:45:35.5609435Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:35.5610081Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:35.5611253Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:35.5612308Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:35.5612905Z AWS_REGION: us-east-1 2024-04-03T12:45:35.5613585Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:35.5614427Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:35.5622712Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:35.5623701Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:35.5624616Z ##[endgroup] 2024-04-03T12:45:35.5637479Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5638109Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5638625Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5639126Z ##[debug]..=> Object 2024-04-03T12:45:35.5639651Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5640267Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5640875Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5641844Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5642655Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5644757Z ##[debug]Evaluating: (success() && (runner.os != 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5645861Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5646725Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5647569Z ##[debug]..=> true 2024-04-03T12:45:35.5648445Z ##[debug]..Evaluating NotEqual: 2024-04-03T12:45:35.5649318Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5650250Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5651172Z ##[debug]......=> Object 2024-04-03T12:45:35.5652013Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5652799Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5653740Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5654636Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5655432Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5656198Z ##[debug]..=> true 2024-04-03T12:45:35.5656928Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5657733Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5658499Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5659485Z ##[debug]......=> Object 2024-04-03T12:45:35.5660346Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5661198Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5662248Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5663014Z ##[debug]..=> false 2024-04-03T12:45:35.5663956Z ##[debug]=> false 2024-04-03T12:45:35.5665434Z ##[debug]Expanded: (true && ('Linux' != 'Windows') && !'true') 2024-04-03T12:45:35.5666733Z ##[debug]Result: false 2024-04-03T12:45:35.5669689Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5670642Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5671303Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5671892Z ##[debug]..=> Object 2024-04-03T12:45:35.5672354Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5672864Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5673580Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5674471Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5675262Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5677288Z ##[debug]Evaluating: (success() && (runner.os != 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5678377Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5679160Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5679972Z ##[debug]..=> true 2024-04-03T12:45:35.5680572Z ##[debug]..Evaluating NotEqual: 2024-04-03T12:45:35.5681416Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5682314Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5683240Z ##[debug]......=> Object 2024-04-03T12:45:35.5684245Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5684967Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5685970Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5686762Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5687662Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5688500Z ##[debug]..=> true 2024-04-03T12:45:35.5689332Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5690156Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5691059Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5691904Z ##[debug]......=> Object 2024-04-03T12:45:35.5692796Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5693605Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5694472Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5695279Z ##[debug]..=> false 2024-04-03T12:45:35.5695965Z ##[debug]=> false 2024-04-03T12:45:35.5697154Z ##[debug]Expanded: (true && ('Linux' != 'Windows') && !'true') 2024-04-03T12:45:35.5698276Z ##[debug]Result: false 2024-04-03T12:45:35.5700530Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5701246Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5701869Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5702529Z ##[debug]..=> Object 2024-04-03T12:45:35.5703076Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5703583Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5704298Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5705088Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5705874Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5707643Z ##[debug]Evaluating: (success() && (runner.os != 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5708562Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5709294Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5709943Z ##[debug]..=> true 2024-04-03T12:45:35.5710645Z ##[debug]..Evaluating NotEqual: 2024-04-03T12:45:35.5711370Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5712104Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5712908Z ##[debug]......=> Object 2024-04-03T12:45:35.5713547Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5714213Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5714828Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5715468Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5716126Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5716743Z ##[debug]..=> true 2024-04-03T12:45:35.5717318Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5717967Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5718624Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5719299Z ##[debug]......=> Object 2024-04-03T12:45:35.5719955Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5720645Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5721513Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5722269Z ##[debug]..=> false 2024-04-03T12:45:35.5723063Z ##[debug]=> false 2024-04-03T12:45:35.5724256Z ##[debug]Expanded: (true && ('Linux' != 'Windows') && !'true') 2024-04-03T12:45:35.5725369Z ##[debug]Result: false 2024-04-03T12:45:35.5727484Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5728051Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5728536Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5729031Z ##[debug]..=> Object 2024-04-03T12:45:35.5729499Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5730076Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5730794Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5731706Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5732544Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5734641Z ##[debug]Evaluating: (success() && (runner.os == 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5735827Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5736629Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5737548Z ##[debug]..=> true 2024-04-03T12:45:35.5738380Z ##[debug]..Evaluating Equal: 2024-04-03T12:45:35.5739278Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5740368Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5741339Z ##[debug]......=> Object 2024-04-03T12:45:35.5742270Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5743127Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5743882Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5744680Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5745585Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5746362Z ##[debug]..=> false 2024-04-03T12:45:35.5746939Z ##[debug]=> false 2024-04-03T12:45:35.5748018Z ##[debug]Expanded: (true && ('Linux' == 'Windows') && !inputs['use-gha']) 2024-04-03T12:45:35.5748939Z ##[debug]Result: false 2024-04-03T12:45:35.5750699Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5751268Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5751865Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5752366Z ##[debug]..=> Object 2024-04-03T12:45:35.5752818Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5753313Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5753909Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5754709Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5755510Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5757230Z ##[debug]Evaluating: (success() && (runner.os == 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5758233Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5759108Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5759945Z ##[debug]..=> true 2024-04-03T12:45:35.5760785Z ##[debug]..Evaluating Equal: 2024-04-03T12:45:35.5761487Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5762394Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5763235Z ##[debug]......=> Object 2024-04-03T12:45:35.5764110Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5764947Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5765667Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5766379Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5767185Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5767805Z ##[debug]..=> false 2024-04-03T12:45:35.5768366Z ##[debug]=> false 2024-04-03T12:45:35.5769482Z ##[debug]Expanded: (true && ('Linux' == 'Windows') && !inputs['use-gha']) 2024-04-03T12:45:35.5770464Z ##[debug]Result: false 2024-04-03T12:45:35.5772362Z ##[debug]Evaluating: inputs.file-suffix 2024-04-03T12:45:35.5773131Z ##[debug]Evaluating Index: 2024-04-03T12:45:35.5773711Z ##[debug]..Evaluating inputs: 2024-04-03T12:45:35.5774255Z ##[debug]..=> Object 2024-04-03T12:45:35.5774767Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5775265Z ##[debug]..=> 'file-suffix' 2024-04-03T12:45:35.5775991Z ##[debug]=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5776786Z ##[debug]Result: 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5777575Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5779372Z ##[debug]Evaluating: (success() && (runner.os == 'Windows') && !inputs.use-gha) 2024-04-03T12:45:35.5780441Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5781279Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5781999Z ##[debug]..=> true 2024-04-03T12:45:35.5782609Z ##[debug]..Evaluating Equal: 2024-04-03T12:45:35.5783312Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5784047Z ##[debug]......Evaluating runner: 2024-04-03T12:45:35.5784795Z ##[debug]......=> Object 2024-04-03T12:45:35.5785494Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5786163Z ##[debug]......=> 'os' 2024-04-03T12:45:35.5786754Z ##[debug]....=> 'Linux' 2024-04-03T12:45:35.5787436Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5788266Z ##[debug]....=> 'Windows' 2024-04-03T12:45:35.5789026Z ##[debug]..=> false 2024-04-03T12:45:35.5789714Z ##[debug]=> false 2024-04-03T12:45:35.5791075Z ##[debug]Expanded: (true && ('Linux' == 'Windows') && !inputs['use-gha']) 2024-04-03T12:45:35.5792129Z ##[debug]Result: false 2024-04-03T12:45:35.5793798Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5795257Z ##[debug]Evaluating: (success() && !inputs.use-gha) 2024-04-03T12:45:35.5796237Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5796980Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5797832Z ##[debug]..=> true 2024-04-03T12:45:35.5798419Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5799217Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5800069Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5800836Z ##[debug]......=> Object 2024-04-03T12:45:35.5801622Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5802550Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5803375Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5804133Z ##[debug]..=> false 2024-04-03T12:45:35.5804912Z ##[debug]=> false 2024-04-03T12:45:35.5805767Z ##[debug]Expanded: (true && !'true') 2024-04-03T12:45:35.5806623Z ##[debug]Result: false 2024-04-03T12:45:35.5808354Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5809857Z ##[debug]Evaluating: (success() && !inputs.use-gha) 2024-04-03T12:45:35.5810798Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5811566Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5812489Z ##[debug]..=> true 2024-04-03T12:45:35.5813167Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5814049Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5814899Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5815694Z ##[debug]......=> Object 2024-04-03T12:45:35.5816454Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5817248Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5817988Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5818704Z ##[debug]..=> false 2024-04-03T12:45:35.5819455Z ##[debug]=> false 2024-04-03T12:45:35.5821025Z ##[debug]Expanded: (true && !'true') 2024-04-03T12:45:35.5822223Z ##[debug]Result: false 2024-04-03T12:45:35.5823993Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5825637Z ##[debug]Evaluating: (success() && !inputs.use-gha) 2024-04-03T12:45:35.5826608Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5827437Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5828342Z ##[debug]..=> true 2024-04-03T12:45:35.5829162Z ##[debug]..Evaluating Not: 2024-04-03T12:45:35.5829948Z ##[debug]....Evaluating Index: 2024-04-03T12:45:35.5830844Z ##[debug]......Evaluating inputs: 2024-04-03T12:45:35.5831678Z ##[debug]......=> Object 2024-04-03T12:45:35.5832512Z ##[debug]......Evaluating String: 2024-04-03T12:45:35.5833255Z ##[debug]......=> 'use-gha' 2024-04-03T12:45:35.5834101Z ##[debug]....=> 'true' 2024-04-03T12:45:35.5834803Z ##[debug]..=> false 2024-04-03T12:45:35.5835561Z ##[debug]=> false 2024-04-03T12:45:35.5836443Z ##[debug]Expanded: (true && !'true') 2024-04-03T12:45:35.5837363Z ##[debug]Result: false 2024-04-03T12:45:35.5839240Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:35.5840909Z ##[debug]Evaluating: (success() && inputs.use-gha) 2024-04-03T12:45:35.5841867Z ##[debug]Evaluating And: 2024-04-03T12:45:35.5842810Z ##[debug]..Evaluating success: 2024-04-03T12:45:35.5843645Z ##[debug]..=> true 2024-04-03T12:45:35.5844473Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5845314Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:35.5846205Z ##[debug]....=> Object 2024-04-03T12:45:35.5847003Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5847896Z ##[debug]....=> 'use-gha' 2024-04-03T12:45:35.5848816Z ##[debug]..=> 'true' 2024-04-03T12:45:35.5849567Z ##[debug]=> 'true' 2024-04-03T12:45:35.5850530Z ##[debug]Expanded: (true && 'true') 2024-04-03T12:45:35.5851436Z ##[debug]Result: 'true' 2024-04-03T12:45:35.5852439Z ##[debug]Starting: run 2024-04-03T12:45:35.5883654Z ##[debug]Loading inputs 2024-04-03T12:45:35.5885665Z ##[debug]Evaluating: format('test-jsons-runattempt{0}-{1}.zip', github.run_attempt, inputs.file-suffix) 2024-04-03T12:45:35.5886935Z ##[debug]Evaluating format: 2024-04-03T12:45:35.5887677Z ##[debug]..Evaluating String: 2024-04-03T12:45:35.5888382Z ##[debug]..=> 'test-jsons-runattempt{0}-{1}.zip' 2024-04-03T12:45:35.5889205Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5889900Z ##[debug]....Evaluating github: 2024-04-03T12:45:35.5890554Z ##[debug]....=> Object 2024-04-03T12:45:35.5891161Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5891935Z ##[debug]....=> 'run_attempt' 2024-04-03T12:45:35.5892599Z ##[debug]..=> '1' 2024-04-03T12:45:35.5893238Z ##[debug]..Evaluating Index: 2024-04-03T12:45:35.5893874Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:35.5894516Z ##[debug]....=> Object 2024-04-03T12:45:35.5895113Z ##[debug]....Evaluating String: 2024-04-03T12:45:35.5895762Z ##[debug]....=> 'file-suffix' 2024-04-03T12:45:35.5896587Z ##[debug]..=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:35.5897700Z ##[debug]=> 'test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:35.5899040Z ##[debug]Result: 'test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:35.5900567Z ##[debug]Loading env 2024-04-03T12:45:35.5909213Z ##[group]Run actions/upload-artifact@v3 2024-04-03T12:45:35.5909881Z with: 2024-04-03T12:45:35.5910666Z name: test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:35.5911603Z retention-days: 14 2024-04-03T12:45:35.5912203Z if-no-files-found: warn 2024-04-03T12:45:35.5912792Z path: test/**/*.json 2024-04-03T12:45:35.5913411Z env: 2024-04-03T12:45:35.5913923Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:35.5914590Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:35.5915757Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:35.5916807Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:35.5917445Z AWS_REGION: us-east-1 2024-04-03T12:45:35.5918118Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:35.5918905Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:35.5927021Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:35.5927870Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:35.5928785Z ##[endgroup] 2024-04-03T12:45:35.6684866Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:35.6687270Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:35.6689679Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:35.6719659Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:35.6722113Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:35.6724453Z ##[debug]matchDirectories 'true' 2024-04-03T12:45:35.6726747Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:35.6729853Z ##[debug]Search path '/home/pytorchci/actions-runner/_work/pytorch/pytorch/test' 2024-04-03T12:45:36.2805609Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/allowlist_for_publicAPI.json was found using the provided searchPath 2024-04-03T12:45:36.2813434Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/benchmark_utils/callgrind_artifacts.json was found using the provided searchPath 2024-04-03T12:45:36.2819797Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/minioptest_failures_dict.json was found using the provided searchPath 2024-04-03T12:45:36.2826697Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/profiler/profiler_utils_mock_events.json was found using the provided searchPath 2024-04-03T12:45:36.2830457Z With the provided path, there will be 4 files uploaded 2024-04-03T12:45:36.2834027Z ##[debug]Root artifact directory is /home/pytorchci/actions-runner/_work/pytorch/pytorch/test 2024-04-03T12:45:36.2836388Z Starting artifact upload 2024-04-03T12:45:36.2841089Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-04-03T12:45:36.2845892Z Artifact name is valid! 2024-04-03T12:45:36.2851060Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:36.6281720Z ##[debug]Upload Resource URL: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/resources/Containers/58275075 2024-04-03T12:45:36.6289118Z Container for artifact "test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip" successfully created. Starting upload of file(s) 2024-04-03T12:45:36.6293517Z ##[debug]File Concurrency: 2, and Chunk Size: 8388608 2024-04-03T12:45:36.6326295Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/allowlist_for_publicAPI.json is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:36.6369006Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/benchmark_utils/callgrind_artifacts.json is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142716-x3dQYoveGFgv to potentially reduce the upload size 2024-04-03T12:45:36.6413364Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/allowlist_for_publicAPI.json helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:36.6565163Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/benchmark_utils/callgrind_artifacts.json is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:37.0940793Z ##[debug]deleting temporary gzip file /tmp/tmp-3142716-x3dQYoveGFgv 2024-04-03T12:45:37.0952930Z ##[debug]File: 1/4. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/benchmark_utils/callgrind_artifacts.json took 463.367 milliseconds to finish upload 2024-04-03T12:45:37.0960533Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/minioptest_failures_dict.json is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:37.0981623Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/minioptest_failures_dict.json helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:37.1185997Z ##[debug]File: 2/4. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/allowlist_for_publicAPI.json took 486.988 milliseconds to finish upload 2024-04-03T12:45:37.1193209Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/profiler/profiler_utils_mock_events.json is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:37.1222046Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/profiler/profiler_utils_mock_events.json helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:37.4789369Z ##[debug]File: 3/4. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/minioptest_failures_dict.json took 383.484 milliseconds to finish upload 2024-04-03T12:45:37.4818566Z ##[debug]File: 4/4. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/profiler/profiler_utils_mock_events.json took 363.253 milliseconds to finish upload 2024-04-03T12:45:37.4823263Z Total size of all the files uploaded is 35027 bytes 2024-04-03T12:45:37.4825813Z File upload process has finished. Finalizing the artifact upload 2024-04-03T12:45:37.4832480Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:37.4842143Z ##[debug]URL is https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview&artifactName=test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:37.7227481Z ##[debug]Artifact test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded, total size in bytes: 329190 2024-04-03T12:45:37.7231514Z Artifact has been finalized. All files have been successfully uploaded! 2024-04-03T12:45:37.7233104Z 2024-04-03T12:45:37.7234683Z The raw size of all the files that were specified for upload is 329190 bytes 2024-04-03T12:45:37.7238718Z The size of all the files that were uploaded is 35027 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-04-03T12:45:37.7241561Z 2024-04-03T12:45:37.7244829Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-04-03T12:45:37.7248095Z 2024-04-03T12:45:37.7250316Z Artifact test-jsons-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded! 2024-04-03T12:45:37.7352613Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:45:37.7361990Z ##[debug]Finished: run 2024-04-03T12:45:37.7370582Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:37.7374960Z ##[debug]Evaluating: (success() && inputs.use-gha) 2024-04-03T12:45:37.7377158Z ##[debug]Evaluating And: 2024-04-03T12:45:37.7378891Z ##[debug]..Evaluating success: 2024-04-03T12:45:37.7381419Z ##[debug]..=> true 2024-04-03T12:45:37.7383391Z ##[debug]..Evaluating Index: 2024-04-03T12:45:37.7385614Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:37.7387712Z ##[debug]....=> Object 2024-04-03T12:45:37.7389849Z ##[debug]....Evaluating String: 2024-04-03T12:45:37.7392116Z ##[debug]....=> 'use-gha' 2024-04-03T12:45:37.7394245Z ##[debug]..=> 'true' 2024-04-03T12:45:37.7396349Z ##[debug]=> 'true' 2024-04-03T12:45:37.7398813Z ##[debug]Expanded: (true && 'true') 2024-04-03T12:45:37.7401413Z ##[debug]Result: 'true' 2024-04-03T12:45:37.7404078Z ##[debug]Starting: run 2024-04-03T12:45:37.7477047Z ##[debug]Loading inputs 2024-04-03T12:45:37.7479237Z ##[debug]Evaluating: format('test-reports-runattempt{0}-{1}.zip', github.run_attempt, inputs.file-suffix) 2024-04-03T12:45:37.7480345Z ##[debug]Evaluating format: 2024-04-03T12:45:37.7480986Z ##[debug]..Evaluating String: 2024-04-03T12:45:37.7481693Z ##[debug]..=> 'test-reports-runattempt{0}-{1}.zip' 2024-04-03T12:45:37.7482672Z ##[debug]..Evaluating Index: 2024-04-03T12:45:37.7483326Z ##[debug]....Evaluating github: 2024-04-03T12:45:37.7483977Z ##[debug]....=> Object 2024-04-03T12:45:37.7484594Z ##[debug]....Evaluating String: 2024-04-03T12:45:37.7485243Z ##[debug]....=> 'run_attempt' 2024-04-03T12:45:37.7485949Z ##[debug]..=> '1' 2024-04-03T12:45:37.7486537Z ##[debug]..Evaluating Index: 2024-04-03T12:45:37.7487179Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:37.7487825Z ##[debug]....=> Object 2024-04-03T12:45:37.7488429Z ##[debug]....Evaluating String: 2024-04-03T12:45:37.7489101Z ##[debug]....=> 'file-suffix' 2024-04-03T12:45:37.7489853Z ##[debug]..=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:37.7491002Z ##[debug]=> 'test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:37.7492431Z ##[debug]Result: 'test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:37.7493621Z ##[debug]Loading env 2024-04-03T12:45:37.7502311Z ##[group]Run actions/upload-artifact@v3 2024-04-03T12:45:37.7503082Z with: 2024-04-03T12:45:37.7504042Z name: test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:37.7505200Z retention-days: 14 2024-04-03T12:45:37.7505882Z if-no-files-found: ignore 2024-04-03T12:45:37.7506664Z path: test/**/*.xml test/**/*.csv 2024-04-03T12:45:37.7507500Z env: 2024-04-03T12:45:37.7508122Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:37.7508929Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:37.7510416Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:37.7511667Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:37.7512442Z AWS_REGION: us-east-1 2024-04-03T12:45:37.7513162Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:37.7513931Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:37.7522118Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:37.7522962Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:37.7524108Z ##[endgroup] 2024-04-03T12:45:37.8296981Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:37.8299350Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:37.8301939Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:37.8338220Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:37.8340875Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:37.8343583Z ##[debug]matchDirectories 'true' 2024-04-03T12:45:37.8345907Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:37.8349044Z ##[debug]Search path '/home/pytorchci/actions-runner/_work/pytorch/pytorch/test' 2024-04-03T12:45:38.4341188Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-2ae8afd0cb1a4502.xml was found using the provided searchPath 2024-04-03T12:45:38.4350526Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-477457cf3e349bab.xml was found using the provided searchPath 2024-04-03T12:45:38.4359119Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-0fdf9ecc7550dd25.xml was found using the provided searchPath 2024-04-03T12:45:38.4366912Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-13493532c7d47654.xml was found using the provided searchPath 2024-04-03T12:45:38.4374561Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-26b86f669559b9b6.xml was found using the provided searchPath 2024-04-03T12:45:38.4382250Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-29e21f73a6aae280.xml was found using the provided searchPath 2024-04-03T12:45:38.4389839Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-2cce9897dbde5277.xml was found using the provided searchPath 2024-04-03T12:45:38.4397367Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-381b0da75a322c21.xml was found using the provided searchPath 2024-04-03T12:45:38.4404970Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-384a41d4db1ea629.xml was found using the provided searchPath 2024-04-03T12:45:38.4412466Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-61f0a9f057e0e3c0.xml was found using the provided searchPath 2024-04-03T12:45:38.4420252Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-62a2ef0d44e26f89.xml was found using the provided searchPath 2024-04-03T12:45:38.4427843Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-7537d978005ed6fc.xml was found using the provided searchPath 2024-04-03T12:45:38.4430739Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-82e4b953989ac41a.xml was found using the provided searchPath 2024-04-03T12:45:38.4433515Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-84cfaae63398e9d4.xml was found using the provided searchPath 2024-04-03T12:45:38.4436440Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-9ff7dc171fc9cee6.xml was found using the provided searchPath 2024-04-03T12:45:38.4439354Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-a65687febe18fc21.xml was found using the provided searchPath 2024-04-03T12:45:38.4442174Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-c458be4a8031c92d.xml was found using the provided searchPath 2024-04-03T12:45:38.4444958Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-ca27edcfa6e392df.xml was found using the provided searchPath 2024-04-03T12:45:38.4447713Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-e69a01fa7d9a5412.xml was found using the provided searchPath 2024-04-03T12:45:38.4450595Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f2c3ef0b14fa72b5.xml was found using the provided searchPath 2024-04-03T12:45:38.4453382Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f7e01420c8f418e6.xml was found using the provided searchPath 2024-04-03T12:45:38.4456156Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_autograd/test_autograd-20ad5bb844e9e8ff.xml was found using the provided searchPath 2024-04-03T12:45:38.4458961Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_binary_ufuncs/test_binary_ufuncs-c842e8510247d0b0.xml was found using the provided searchPath 2024-04-03T12:45:38.4461796Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_cuda/test_cuda-fded1be63da4e352.xml was found using the provided searchPath 2024-04-03T12:45:38.4464455Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_nn/test_nn-7c4a94df2831e8d9.xml was found using the provided searchPath 2024-04-03T12:45:38.4467100Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-07882f1a134c6e45.xml was found using the provided searchPath 2024-04-03T12:45:38.4469732Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-1eb641fd7eeb4b15.xml was found using the provided searchPath 2024-04-03T12:45:38.4472375Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-31c34b76f1964d8e.xml was found using the provided searchPath 2024-04-03T12:45:38.4475024Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-3aff72fcc3c543e7.xml was found using the provided searchPath 2024-04-03T12:45:38.4477654Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-4d8085e32174cf0c.xml was found using the provided searchPath 2024-04-03T12:45:38.4480436Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-6ab071461d9c491b.xml was found using the provided searchPath 2024-04-03T12:45:38.4483622Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-8932978f61174e80.xml was found using the provided searchPath 2024-04-03T12:45:38.4486923Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-997f3766f9e26d1d.xml was found using the provided searchPath 2024-04-03T12:45:38.4490282Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a0dfcf299b089fe7.xml was found using the provided searchPath 2024-04-03T12:45:38.4493491Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a76b79796a5f3fa6.xml was found using the provided searchPath 2024-04-03T12:45:38.4496852Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_torch/test_torch-68f33d1fb3a77a7a.xml was found using the provided searchPath 2024-04-03T12:45:38.4499924Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-41928f9edbc768d1.xml was found using the provided searchPath 2024-04-03T12:45:38.4501553Z With the provided path, there will be 37 files uploaded 2024-04-03T12:45:38.4502856Z ##[debug]Root artifact directory is /home/pytorchci/actions-runner/_work/pytorch/pytorch/test 2024-04-03T12:45:38.4503740Z Starting artifact upload 2024-04-03T12:45:38.4505584Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-04-03T12:45:38.4507202Z Artifact name is valid! 2024-04-03T12:45:38.4509113Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:38.7787349Z ##[debug]Upload Resource URL: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/resources/Containers/58275075 2024-04-03T12:45:38.7793854Z Container for artifact "test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip" successfully created. Starting upload of file(s) 2024-04-03T12:45:38.7798205Z ##[debug]File Concurrency: 2, and Chunk Size: 8388608 2024-04-03T12:45:38.7872445Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-477457cf3e349bab.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-wlKnQvxZoDLg to potentially reduce the upload size 2024-04-03T12:45:38.7886608Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-2ae8afd0cb1a4502.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-XzD0MzUc7wgh to potentially reduce the upload size 2024-04-03T12:45:38.7939526Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-477457cf3e349bab.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:38.7950924Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-2ae8afd0cb1a4502.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.0966364Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-XzD0MzUc7wgh 2024-04-03T12:45:39.0977902Z ##[debug]File: 1/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-2ae8afd0cb1a4502.xml took 313.289 milliseconds to finish upload 2024-04-03T12:45:39.0987271Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-0fdf9ecc7550dd25.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.1016102Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-0fdf9ecc7550dd25.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.1622036Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-wlKnQvxZoDLg 2024-04-03T12:45:39.1628656Z ##[debug]File: 2/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/inductor.test_torchinductor/inductor.test_torchinductor-477457cf3e349bab.xml took 378.013 milliseconds to finish upload 2024-04-03T12:45:39.1637780Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-13493532c7d47654.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.1647363Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-13493532c7d47654.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.4646357Z ##[debug]File: 3/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-0fdf9ecc7550dd25.xml took 366.676 milliseconds to finish upload 2024-04-03T12:45:39.4655390Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-26b86f669559b9b6.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.4670885Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-26b86f669559b9b6.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.5166507Z ##[debug]File: 4/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-13493532c7d47654.xml took 353.950 milliseconds to finish upload 2024-04-03T12:45:39.5172727Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-29e21f73a6aae280.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.5185822Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-29e21f73a6aae280.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.8195762Z ##[debug]File: 5/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-26b86f669559b9b6.xml took 354.859 milliseconds to finish upload 2024-04-03T12:45:39.8204218Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-2cce9897dbde5277.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.8216451Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-2cce9897dbde5277.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:39.8810147Z ##[debug]File: 6/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-29e21f73a6aae280.xml took 364.142 milliseconds to finish upload 2024-04-03T12:45:39.8818974Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-381b0da75a322c21.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:39.8829044Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-381b0da75a322c21.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.1815568Z ##[debug]File: 7/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-2cce9897dbde5277.xml took 361.906 milliseconds to finish upload 2024-04-03T12:45:40.1824656Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-384a41d4db1ea629.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.1836708Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-384a41d4db1ea629.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.2378436Z ##[debug]File: 8/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-381b0da75a322c21.xml took 356.774 milliseconds to finish upload 2024-04-03T12:45:40.2387127Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-61f0a9f057e0e3c0.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.2415699Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-61f0a9f057e0e3c0.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.4622632Z ##[debug]File: 9/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-384a41d4db1ea629.xml took 280.625 milliseconds to finish upload 2024-04-03T12:45:40.4631144Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-62a2ef0d44e26f89.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.4643228Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-62a2ef0d44e26f89.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.6062290Z ##[debug]File: 10/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-61f0a9f057e0e3c0.xml took 368.281 milliseconds to finish upload 2024-04-03T12:45:40.6070824Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-7537d978005ed6fc.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.6080281Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-7537d978005ed6fc.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.8447909Z ##[debug]File: 11/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-62a2ef0d44e26f89.xml took 382.445 milliseconds to finish upload 2024-04-03T12:45:40.8456440Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-82e4b953989ac41a.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.8466078Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-82e4b953989ac41a.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:40.9582031Z ##[debug]File: 12/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-7537d978005ed6fc.xml took 351.867 milliseconds to finish upload 2024-04-03T12:45:40.9590979Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-84cfaae63398e9d4.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:40.9600691Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-84cfaae63398e9d4.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:41.1990139Z ##[debug]File: 13/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-82e4b953989ac41a.xml took 354.099 milliseconds to finish upload 2024-04-03T12:45:41.1999221Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-9ff7dc171fc9cee6.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:41.2009307Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-9ff7dc171fc9cee6.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:41.3266652Z ##[debug]File: 14/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-84cfaae63398e9d4.xml took 368.421 milliseconds to finish upload 2024-04-03T12:45:41.3275109Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-a65687febe18fc21.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:41.3284556Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-a65687febe18fc21.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:41.5495231Z ##[debug]File: 15/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-9ff7dc171fc9cee6.xml took 350.470 milliseconds to finish upload 2024-04-03T12:45:41.5503999Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-c458be4a8031c92d.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:41.5513474Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-c458be4a8031c92d.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:41.6851255Z ##[debug]File: 16/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-a65687febe18fc21.xml took 358.399 milliseconds to finish upload 2024-04-03T12:45:41.6859899Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-ca27edcfa6e392df.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:41.6869380Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-ca27edcfa6e392df.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:41.9030534Z ##[debug]File: 17/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-c458be4a8031c92d.xml took 353.456 milliseconds to finish upload 2024-04-03T12:45:41.9039138Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-e69a01fa7d9a5412.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:41.9049263Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-e69a01fa7d9a5412.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:42.0245771Z ##[debug]File: 18/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-ca27edcfa6e392df.xml took 339.387 milliseconds to finish upload 2024-04-03T12:45:42.0254319Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f2c3ef0b14fa72b5.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:42.0264063Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f2c3ef0b14fa72b5.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:42.2683162Z ##[debug]File: 19/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-e69a01fa7d9a5412.xml took 365.205 milliseconds to finish upload 2024-04-03T12:45:42.2691663Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f7e01420c8f418e6.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:42.2701323Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f7e01420c8f418e6.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:42.3651063Z ##[debug]File: 20/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f2c3ef0b14fa72b5.xml took 340.476 milliseconds to finish upload 2024-04-03T12:45:42.3660642Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_autograd/test_autograd-20ad5bb844e9e8ff.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-jetmYLKrp4zB to potentially reduce the upload size 2024-04-03T12:45:42.3714706Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_autograd/test_autograd-20ad5bb844e9e8ff.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:42.6203058Z ##[debug]File: 21/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test.run_test/test.run_test-f7e01420c8f418e6.xml took 351.936 milliseconds to finish upload 2024-04-03T12:45:42.6212230Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_binary_ufuncs/test_binary_ufuncs-c842e8510247d0b0.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-3NGWG4zSOHIn to potentially reduce the upload size 2024-04-03T12:45:42.6616607Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_binary_ufuncs/test_binary_ufuncs-c842e8510247d0b0.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:42.8324448Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-jetmYLKrp4zB 2024-04-03T12:45:42.8330109Z ##[debug]File: 22/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_autograd/test_autograd-20ad5bb844e9e8ff.xml took 467.797 milliseconds to finish upload 2024-04-03T12:45:42.8338432Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_cuda/test_cuda-fded1be63da4e352.xml is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:42.8349911Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_cuda/test_cuda-fded1be63da4e352.xml helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:43.1445040Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-3NGWG4zSOHIn 2024-04-03T12:45:43.1450800Z ##[debug]File: 23/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_binary_ufuncs/test_binary_ufuncs-c842e8510247d0b0.xml took 524.730 milliseconds to finish upload 2024-04-03T12:45:43.1459942Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_nn/test_nn-7c4a94df2831e8d9.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-fQb7RY2Y5qYu to potentially reduce the upload size 2024-04-03T12:45:43.1604442Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_nn/test_nn-7c4a94df2831e8d9.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:43.1931408Z ##[debug]File: 24/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_cuda/test_cuda-fded1be63da4e352.xml took 360.188 milliseconds to finish upload 2024-04-03T12:45:43.1940376Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-07882f1a134c6e45.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-XCvgIVq4bTv5 to potentially reduce the upload size 2024-04-03T12:45:43.2102769Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-07882f1a134c6e45.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:43.6263397Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-fQb7RY2Y5qYu 2024-04-03T12:45:43.6268851Z ##[debug]File: 25/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_nn/test_nn-7c4a94df2831e8d9.xml took 481.835 milliseconds to finish upload 2024-04-03T12:45:43.6277818Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-1eb641fd7eeb4b15.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-0uqv7kcaJC0R to potentially reduce the upload size 2024-04-03T12:45:43.6451691Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-1eb641fd7eeb4b15.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:43.6804899Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-XCvgIVq4bTv5 2024-04-03T12:45:43.6810406Z ##[debug]File: 26/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-07882f1a134c6e45.xml took 487.883 milliseconds to finish upload 2024-04-03T12:45:43.6819109Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-31c34b76f1964d8e.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-PRi7r9HlB9BB to potentially reduce the upload size 2024-04-03T12:45:43.6966044Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-31c34b76f1964d8e.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:44.0694355Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-0uqv7kcaJC0R 2024-04-03T12:45:44.0702033Z ##[debug]File: 27/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-1eb641fd7eeb4b15.xml took 443.256 milliseconds to finish upload 2024-04-03T12:45:44.0714526Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-3aff72fcc3c543e7.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-PQYpWiL0DUEe to potentially reduce the upload size 2024-04-03T12:45:44.0881098Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-3aff72fcc3c543e7.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:44.1234650Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-PRi7r9HlB9BB 2024-04-03T12:45:44.1240115Z ##[debug]File: 28/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-31c34b76f1964d8e.xml took 442.854 milliseconds to finish upload 2024-04-03T12:45:44.1248762Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-4d8085e32174cf0c.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-7C1ZAHPrmgSw to potentially reduce the upload size 2024-04-03T12:45:44.1407855Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-4d8085e32174cf0c.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:44.4328418Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-PQYpWiL0DUEe 2024-04-03T12:45:44.4333880Z ##[debug]File: 29/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-3aff72fcc3c543e7.xml took 363.085 milliseconds to finish upload 2024-04-03T12:45:44.4345020Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-6ab071461d9c491b.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-MSnLkV7rlwKO to potentially reduce the upload size 2024-04-03T12:45:44.4525752Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-6ab071461d9c491b.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:44.5819258Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-7C1ZAHPrmgSw 2024-04-03T12:45:44.5824938Z ##[debug]File: 30/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-4d8085e32174cf0c.xml took 458.432 milliseconds to finish upload 2024-04-03T12:45:44.5833641Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-8932978f61174e80.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-es2oq7YJovJX to potentially reduce the upload size 2024-04-03T12:45:44.6012373Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-8932978f61174e80.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:44.9234503Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-MSnLkV7rlwKO 2024-04-03T12:45:44.9239966Z ##[debug]File: 31/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-6ab071461d9c491b.xml took 490.579 milliseconds to finish upload 2024-04-03T12:45:44.9249182Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-997f3766f9e26d1d.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-0ic0I61ffezv to potentially reduce the upload size 2024-04-03T12:45:44.9428487Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-997f3766f9e26d1d.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:45.0675039Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-es2oq7YJovJX 2024-04-03T12:45:45.0680948Z ##[debug]File: 32/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-8932978f61174e80.xml took 485.629 milliseconds to finish upload 2024-04-03T12:45:45.0690047Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a0dfcf299b089fe7.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-szT26A4v6ZgP to potentially reduce the upload size 2024-04-03T12:45:45.0899935Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a0dfcf299b089fe7.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:45.4061290Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-0ic0I61ffezv 2024-04-03T12:45:45.4066772Z ##[debug]File: 33/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-997f3766f9e26d1d.xml took 482.572 milliseconds to finish upload 2024-04-03T12:45:45.4078558Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a76b79796a5f3fa6.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-Gds1SiYDswwJ to potentially reduce the upload size 2024-04-03T12:45:45.4241315Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a76b79796a5f3fa6.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:45.5984637Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-szT26A4v6ZgP 2024-04-03T12:45:45.5990089Z ##[debug]File: 34/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a0dfcf299b089fe7.xml took 530.948 milliseconds to finish upload 2024-04-03T12:45:45.5999572Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_torch/test_torch-68f33d1fb3a77a7a.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-eYedJJBc61QD to potentially reduce the upload size 2024-04-03T12:45:45.6059234Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_torch/test_torch-68f33d1fb3a77a7a.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:45.9170013Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-Gds1SiYDswwJ 2024-04-03T12:45:45.9175444Z ##[debug]File: 35/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_ops/test_ops-a76b79796a5f3fa6.xml took 510.822 milliseconds to finish upload 2024-04-03T12:45:45.9184780Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-41928f9edbc768d1.xml is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142727-o8iSBWRNo1zy to potentially reduce the upload size 2024-04-03T12:45:45.9883452Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-41928f9edbc768d1.xml is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:46.0271593Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-eYedJJBc61QD 2024-04-03T12:45:46.0278031Z ##[debug]File: 36/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_torch/test_torch-68f33d1fb3a77a7a.xml took 428.878 milliseconds to finish upload 2024-04-03T12:45:46.6158232Z ##[debug]deleting temporary gzip file /tmp/tmp-3142727-o8iSBWRNo1zy 2024-04-03T12:45:46.6165756Z ##[debug]File: 37/37. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/python-pytest/test_unary_ufuncs/test_unary_ufuncs-41928f9edbc768d1.xml took 698.799 milliseconds to finish upload 2024-04-03T12:45:46.6170674Z Total size of all the files uploaded is 610283 bytes 2024-04-03T12:45:46.6172836Z File upload process has finished. Finalizing the artifact upload 2024-04-03T12:45:46.6178774Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:46.6189647Z ##[debug]URL is https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview&artifactName=test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:46.9188153Z ##[debug]Artifact test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded, total size in bytes: 11931362 2024-04-03T12:45:46.9192233Z Artifact has been finalized. All files have been successfully uploaded! 2024-04-03T12:45:46.9193830Z 2024-04-03T12:45:46.9194982Z The raw size of all the files that were specified for upload is 11931362 bytes 2024-04-03T12:45:46.9199358Z The size of all the files that were uploaded is 610283 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-04-03T12:45:46.9202175Z 2024-04-03T12:45:46.9205438Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-04-03T12:45:46.9208679Z 2024-04-03T12:45:46.9210857Z Artifact test-reports-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded! 2024-04-03T12:45:46.9327400Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:45:46.9336905Z ##[debug]Finished: run 2024-04-03T12:45:46.9347775Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:46.9352724Z ##[debug]Evaluating: (success() && inputs.use-gha) 2024-04-03T12:45:46.9355427Z ##[debug]Evaluating And: 2024-04-03T12:45:46.9357713Z ##[debug]..Evaluating success: 2024-04-03T12:45:46.9360074Z ##[debug]..=> true 2024-04-03T12:45:46.9362102Z ##[debug]..Evaluating Index: 2024-04-03T12:45:46.9364433Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:46.9366579Z ##[debug]....=> Object 2024-04-03T12:45:46.9368833Z ##[debug]....Evaluating String: 2024-04-03T12:45:46.9371126Z ##[debug]....=> 'use-gha' 2024-04-03T12:45:46.9373317Z ##[debug]..=> 'true' 2024-04-03T12:45:46.9375465Z ##[debug]=> 'true' 2024-04-03T12:45:46.9377828Z ##[debug]Expanded: (true && 'true') 2024-04-03T12:45:46.9380638Z ##[debug]Result: 'true' 2024-04-03T12:45:46.9383240Z ##[debug]Starting: run 2024-04-03T12:45:46.9441074Z ##[debug]Loading inputs 2024-04-03T12:45:46.9443176Z ##[debug]Evaluating: format('logs-runattempt{0}-{1}.zip', github.run_attempt, inputs.file-suffix) 2024-04-03T12:45:46.9444225Z ##[debug]Evaluating format: 2024-04-03T12:45:46.9444849Z ##[debug]..Evaluating String: 2024-04-03T12:45:46.9445543Z ##[debug]..=> 'logs-runattempt{0}-{1}.zip' 2024-04-03T12:45:46.9446424Z ##[debug]..Evaluating Index: 2024-04-03T12:45:46.9447055Z ##[debug]....Evaluating github: 2024-04-03T12:45:46.9447719Z ##[debug]....=> Object 2024-04-03T12:45:46.9448318Z ##[debug]....Evaluating String: 2024-04-03T12:45:46.9448975Z ##[debug]....=> 'run_attempt' 2024-04-03T12:45:46.9449617Z ##[debug]..=> '1' 2024-04-03T12:45:46.9450266Z ##[debug]..Evaluating Index: 2024-04-03T12:45:46.9450928Z ##[debug]....Evaluating inputs: 2024-04-03T12:45:46.9451564Z ##[debug]....=> Object 2024-04-03T12:45:46.9452156Z ##[debug]....Evaluating String: 2024-04-03T12:45:46.9452896Z ##[debug]....=> 'file-suffix' 2024-04-03T12:45:46.9453649Z ##[debug]..=> 'test-default-1-1-linux.rocm.gpu_23389745826' 2024-04-03T12:45:46.9454718Z ##[debug]=> 'logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:46.9455937Z ##[debug]Result: 'logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip' 2024-04-03T12:45:46.9457114Z ##[debug]Loading env 2024-04-03T12:45:46.9465720Z ##[group]Run actions/upload-artifact@v3 2024-04-03T12:45:46.9466383Z with: 2024-04-03T12:45:46.9467129Z name: logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:46.9468024Z retention-days: 14 2024-04-03T12:45:46.9468607Z if-no-files-found: ignore 2024-04-03T12:45:46.9469236Z path: usage_log.txt test/**/*.log 2024-04-03T12:45:46.9469947Z env: 2024-04-03T12:45:46.9470457Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:46.9471332Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:46.9472511Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:46.9473562Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:46.9474204Z AWS_REGION: us-east-1 2024-04-03T12:45:46.9474887Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:46.9475654Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:46.9483816Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:46.9484657Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:46.9485653Z ##[endgroup] 2024-04-03T12:45:47.0241952Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:47.0244859Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:47.0247162Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:47.0281449Z ##[debug]followSymbolicLinks 'true' 2024-04-03T12:45:47.0283849Z ##[debug]implicitDescendants 'true' 2024-04-03T12:45:47.0286146Z ##[debug]matchDirectories 'true' 2024-04-03T12:45:47.0288438Z ##[debug]omitBrokenSymbolicLinks 'true' 2024-04-03T12:45:47.0291612Z ##[debug]Search path '/home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt' 2024-04-03T12:45:47.0298965Z ##[debug]Search path '/home/pytorchci/actions-runner/_work/pytorch/pytorch/test' 2024-04-03T12:45:47.7348524Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt was found using the provided searchPath 2024-04-03T12:45:47.7354548Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dict_test_1.1_eea8d3f17ca7dd6e_.log was found using the provided searchPath 2024-04-03T12:45:47.7361315Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dimname_test_1.1_08c97e9aaefb0635_.log was found using the provided searchPath 2024-04-03T12:45:47.7368100Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.NamedTensor_test_1.1_a8dfdbdfb46be735_.log was found using the provided searchPath 2024-04-03T12:45:47.7374969Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.apply_utils_test_1.1_68d9c458701743b3_.log was found using the provided searchPath 2024-04-03T12:45:47.7381963Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.atest_1.1_b3ed596288111141_.log was found using the provided searchPath 2024-04-03T12:45:47.7388458Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.basic_1.1_8ad405c68b3c9450_.log was found using the provided searchPath 2024-04-03T12:45:47.7395161Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.broadcast_test_1.1_a1b53734d2985b07_.log was found using the provided searchPath 2024-04-03T12:45:47.7401868Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.cpu_generator_test_1.1_740415e53527f4fe_.log was found using the provided searchPath 2024-04-03T12:45:47.7408669Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.dlconvertor_test_1.1_246a6a66396b482a_.log was found using the provided searchPath 2024-04-03T12:45:47.7415501Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.extension_backend_test_1.1_71867304633d616d_.log was found using the provided searchPath 2024-04-03T12:45:47.7422371Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.lazy_tensor_test_1.1_c65241e3a4acb408_.log was found using the provided searchPath 2024-04-03T12:45:47.7429547Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.legacy_vmap_test_1.1_09ca941fda98da41_.log was found using the provided searchPath 2024-04-03T12:45:47.7436253Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.native_test_1.1_ca23ed2d0b9a9804_.log was found using the provided searchPath 2024-04-03T12:45:47.7443208Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.operators_test_1.1_3fca26950c6290e2_.log was found using the provided searchPath 2024-04-03T12:45:47.7449992Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_tensor_test_1.1_04259dce2fa8a4a0_.log was found using the provided searchPath 2024-04-03T12:45:47.7456705Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_test_1.1_a524f8e38d588a13_.log was found using the provided searchPath 2024-04-03T12:45:47.7463594Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.tensor_iterator_test_1.1_8b242e916db0b27c_.log was found using the provided searchPath 2024-04-03T12:45:47.7470687Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.undefined_tensor_test_1.1_a70bc1eb9d45214c_.log was found using the provided searchPath 2024-04-03T12:45:47.7477499Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.wrapdim_test_1.1_ed33bb88894fd28b_.log was found using the provided searchPath 2024-04-03T12:45:47.7484313Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_1.2_d8c374349ca31492_.log was found using the provided searchPath 2024-04-03T12:45:47.7491272Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_2.2_81c7c8f305edca70_.log was found using the provided searchPath 2024-04-03T12:45:47.7497995Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_autograd_1.1_64b93bfb7a9554cc_.log was found using the provided searchPath 2024-04-03T12:45:47.7504981Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_binary_ufuncs_1.1_f1b11807bec3c907_.log was found using the provided searchPath 2024-04-03T12:45:47.7511518Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_cuda_1.1_be5818e4c77f9677_.log was found using the provided searchPath 2024-04-03T12:45:47.7516069Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_nn_1.1_fd4c49d4ed7f525b_.log was found using the provided searchPath 2024-04-03T12:45:47.7518426Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_1.10_32c20bd2f2b0e279_.log was found using the provided searchPath 2024-04-03T12:45:47.7520812Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_10.10_21964ae873e2930d_.log was found using the provided searchPath 2024-04-03T12:45:47.7523206Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_2.10_dbac769edb38711e_.log was found using the provided searchPath 2024-04-03T12:45:47.7525582Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_3.10_c151b39a64edb912_.log was found using the provided searchPath 2024-04-03T12:45:47.7527963Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_4.10_dec7da1c824451ee_.log was found using the provided searchPath 2024-04-03T12:45:47.7530361Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_5.10_d6135ccdc22f06c9_.log was found using the provided searchPath 2024-04-03T12:45:47.7532724Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_6.10_4e3eaa86326da093_.log was found using the provided searchPath 2024-04-03T12:45:47.7535220Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_7.10_42fb8f0437415d49_.log was found using the provided searchPath 2024-04-03T12:45:47.7537606Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_8.10_abe63000578516da_.log was found using the provided searchPath 2024-04-03T12:45:47.7540269Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_9.10_9497e0f5766be958_.log was found using the provided searchPath 2024-04-03T12:45:47.7542673Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_torch_1.1_88a66d12edd14a59_.log was found using the provided searchPath 2024-04-03T12:45:47.7545120Z ##[debug]File:/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_unary_ufuncs_1.1_91fe2da0719b659f_.log was found using the provided searchPath 2024-04-03T12:45:47.7546728Z Multiple search paths detected. Calculating the least common ancestor of all paths 2024-04-03T12:45:47.7548312Z ##[debug]Using search path /home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt 2024-04-03T12:45:47.7549774Z ##[debug]Using search path /home/pytorchci/actions-runner/_work/pytorch/pytorch/test 2024-04-03T12:45:47.7551301Z The least common ancestor is /home/pytorchci/actions-runner/_work/pytorch/pytorch. This will be the root directory of the artifact 2024-04-03T12:45:47.7552561Z With the provided path, there will be 38 files uploaded 2024-04-03T12:45:47.7553817Z ##[debug]Root artifact directory is /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T12:45:47.7554663Z Starting artifact upload 2024-04-03T12:45:47.7556390Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-04-03T12:45:47.7558021Z Artifact name is valid! 2024-04-03T12:45:47.7559938Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:48.0847387Z ##[debug]Upload Resource URL: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/resources/Containers/58275075 2024-04-03T12:45:48.0853488Z Container for artifact "logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip" successfully created. Starting upload of file(s) 2024-04-03T12:45:48.0857682Z ##[debug]File Concurrency: 2, and Chunk Size: 8388608 2024-04-03T12:45:48.0909338Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dict_test_1.1_eea8d3f17ca7dd6e_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:48.0931676Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-5ewZv0fRtC9V to potentially reduce the upload size 2024-04-03T12:45:48.0959037Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dict_test_1.1_eea8d3f17ca7dd6e_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:48.1738173Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:48.5126393Z ##[debug]File: 1/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dict_test_1.1_eea8d3f17ca7dd6e_.log took 423.346 milliseconds to finish upload 2024-04-03T12:45:48.5134159Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dimname_test_1.1_08c97e9aaefb0635_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:48.5148521Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dimname_test_1.1_08c97e9aaefb0635_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:48.8180255Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-5ewZv0fRtC9V 2024-04-03T12:45:48.8191769Z ##[debug]File: 2/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/usage_log.txt took 730.260 milliseconds to finish upload 2024-04-03T12:45:48.8199457Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.NamedTensor_test_1.1_a8dfdbdfb46be735_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:48.8211338Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.NamedTensor_test_1.1_a8dfdbdfb46be735_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:48.8509436Z ##[debug]File: 3/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.Dimname_test_1.1_08c97e9aaefb0635_.log took 338.212 milliseconds to finish upload 2024-04-03T12:45:48.8518006Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.apply_utils_test_1.1_68d9c458701743b3_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:48.8527230Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.apply_utils_test_1.1_68d9c458701743b3_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.1218544Z ##[debug]File: 4/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.apply_utils_test_1.1_68d9c458701743b3_.log took 270.832 milliseconds to finish upload 2024-04-03T12:45:49.1226934Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.atest_1.1_b3ed596288111141_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.1239322Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.atest_1.1_b3ed596288111141_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.1586862Z ##[debug]File: 5/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.NamedTensor_test_1.1_a8dfdbdfb46be735_.log took 339.265 milliseconds to finish upload 2024-04-03T12:45:49.1595097Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.basic_1.1_8ad405c68b3c9450_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.1604160Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.basic_1.1_8ad405c68b3c9450_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.4802160Z ##[debug]File: 6/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.atest_1.1_b3ed596288111141_.log took 358.331 milliseconds to finish upload 2024-04-03T12:45:49.4809701Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.broadcast_test_1.1_a1b53734d2985b07_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.4820600Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.broadcast_test_1.1_a1b53734d2985b07_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.5769432Z ##[debug]File: 7/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.basic_1.1_8ad405c68b3c9450_.log took 418.087 milliseconds to finish upload 2024-04-03T12:45:49.5778286Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.cpu_generator_test_1.1_740415e53527f4fe_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.5788760Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.cpu_generator_test_1.1_740415e53527f4fe_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.8397303Z ##[debug]File: 8/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.broadcast_test_1.1_a1b53734d2985b07_.log took 359.362 milliseconds to finish upload 2024-04-03T12:45:49.8405700Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.dlconvertor_test_1.1_246a6a66396b482a_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.8418969Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.dlconvertor_test_1.1_246a6a66396b482a_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:49.9405213Z ##[debug]File: 9/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.cpu_generator_test_1.1_740415e53527f4fe_.log took 363.621 milliseconds to finish upload 2024-04-03T12:45:49.9413936Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.extension_backend_test_1.1_71867304633d616d_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:49.9423468Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.extension_backend_test_1.1_71867304633d616d_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:50.0994584Z ##[debug]File: 10/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.dlconvertor_test_1.1_246a6a66396b482a_.log took 259.623 milliseconds to finish upload 2024-04-03T12:45:50.1002847Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.lazy_tensor_test_1.1_c65241e3a4acb408_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:50.1012327Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.lazy_tensor_test_1.1_c65241e3a4acb408_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:50.2970313Z ##[debug]File: 11/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.extension_backend_test_1.1_71867304633d616d_.log took 356.411 milliseconds to finish upload 2024-04-03T12:45:50.2978773Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.legacy_vmap_test_1.1_09ca941fda98da41_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:50.2988354Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.legacy_vmap_test_1.1_09ca941fda98da41_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:50.4869050Z ##[debug]File: 12/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.lazy_tensor_test_1.1_c65241e3a4acb408_.log took 387.386 milliseconds to finish upload 2024-04-03T12:45:50.4877256Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.native_test_1.1_ca23ed2d0b9a9804_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:50.4886460Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.native_test_1.1_ca23ed2d0b9a9804_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:50.8062393Z ##[debug]File: 13/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.legacy_vmap_test_1.1_09ca941fda98da41_.log took 509.098 milliseconds to finish upload 2024-04-03T12:45:50.8071231Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.operators_test_1.1_3fca26950c6290e2_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:50.8080619Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.operators_test_1.1_3fca26950c6290e2_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:50.8439422Z ##[debug]File: 14/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.native_test_1.1_ca23ed2d0b9a9804_.log took 356.977 milliseconds to finish upload 2024-04-03T12:45:50.8447764Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_tensor_test_1.1_04259dce2fa8a4a0_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:50.8457541Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_tensor_test_1.1_04259dce2fa8a4a0_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.1353624Z ##[debug]File: 15/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_tensor_test_1.1_04259dce2fa8a4a0_.log took 291.340 milliseconds to finish upload 2024-04-03T12:45:51.1361926Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_test_1.1_a524f8e38d588a13_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.1371082Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_test_1.1_a524f8e38d588a13_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.1728587Z ##[debug]File: 16/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.operators_test_1.1_3fca26950c6290e2_.log took 366.581 milliseconds to finish upload 2024-04-03T12:45:51.1736951Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.tensor_iterator_test_1.1_8b242e916db0b27c_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.1747293Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.tensor_iterator_test_1.1_8b242e916db0b27c_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.4929021Z ##[debug]File: 17/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.scalar_test_1.1_a524f8e38d588a13_.log took 357.490 milliseconds to finish upload 2024-04-03T12:45:51.4937311Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.undefined_tensor_test_1.1_a70bc1eb9d45214c_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.4947564Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.undefined_tensor_test_1.1_a70bc1eb9d45214c_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.5405606Z ##[debug]File: 18/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.tensor_iterator_test_1.1_8b242e916db0b27c_.log took 367.609 milliseconds to finish upload 2024-04-03T12:45:51.5414651Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.wrapdim_test_1.1_ed33bb88894fd28b_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.5424342Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.wrapdim_test_1.1_ed33bb88894fd28b_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.8763360Z ##[debug]File: 19/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.undefined_tensor_test_1.1_a70bc1eb9d45214c_.log took 383.218 milliseconds to finish upload 2024-04-03T12:45:51.8772887Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_1.2_d8c374349ca31492_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.8799148Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_1.2_d8c374349ca31492_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:51.9134680Z ##[debug]File: 20/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/cpp.wrapdim_test_1.1_ed33bb88894fd28b_.log took 372.935 milliseconds to finish upload 2024-04-03T12:45:51.9143656Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_2.2_81c7c8f305edca70_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:51.9168837Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_2.2_81c7c8f305edca70_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:52.2798273Z ##[debug]File: 21/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_2.2_81c7c8f305edca70_.log took 366.276 milliseconds to finish upload 2024-04-03T12:45:52.2809481Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_autograd_1.1_64b93bfb7a9554cc_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-Qw5Tt27KpbZl to potentially reduce the upload size 2024-04-03T12:45:52.2880704Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_autograd_1.1_64b93bfb7a9554cc_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:52.2971645Z ##[debug]File: 22/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/inductor.test_torchinductor_1.2_d8c374349ca31492_.log took 421.024 milliseconds to finish upload 2024-04-03T12:45:52.2984881Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_binary_ufuncs_1.1_f1b11807bec3c907_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-bIseOsLrL4aX to potentially reduce the upload size 2024-04-03T12:45:52.3740547Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_binary_ufuncs_1.1_f1b11807bec3c907_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:52.7179085Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-Qw5Tt27KpbZl 2024-04-03T12:45:52.7185065Z ##[debug]File: 23/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_autograd_1.1_64b93bfb7a9554cc_.log took 438.758 milliseconds to finish upload 2024-04-03T12:45:52.7193349Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_cuda_1.1_be5818e4c77f9677_.log is less than 64k in size. Creating a gzip file in-memory to potentially reduce the upload size 2024-04-03T12:45:52.7217131Z ##[debug]A gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_cuda_1.1_be5818e4c77f9677_.log helped with reducing the size of the original file. The file will be uploaded using gzip. 2024-04-03T12:45:53.0083791Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-bIseOsLrL4aX 2024-04-03T12:45:53.0089798Z ##[debug]File: 24/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_binary_ufuncs_1.1_f1b11807bec3c907_.log took 711.708 milliseconds to finish upload 2024-04-03T12:45:53.0099198Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_nn_1.1_fd4c49d4ed7f525b_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-bZ6AQneXgTVg to potentially reduce the upload size 2024-04-03T12:45:53.0300300Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_nn_1.1_fd4c49d4ed7f525b_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:53.0745395Z ##[debug]File: 25/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_cuda_1.1_be5818e4c77f9677_.log took 355.970 milliseconds to finish upload 2024-04-03T12:45:53.0755636Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_1.10_32c20bd2f2b0e279_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-KtgvnWLw39c8 to potentially reduce the upload size 2024-04-03T12:45:53.1002905Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_1.10_32c20bd2f2b0e279_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:53.4838853Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-bZ6AQneXgTVg 2024-04-03T12:45:53.4844045Z ##[debug]File: 26/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_nn_1.1_fd4c49d4ed7f525b_.log took 475.476 milliseconds to finish upload 2024-04-03T12:45:53.4852341Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_10.10_21964ae873e2930d_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-fgYhZ8AVpoPT to potentially reduce the upload size 2024-04-03T12:45:53.5075428Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_10.10_21964ae873e2930d_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:53.6377074Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-KtgvnWLw39c8 2024-04-03T12:45:53.6382409Z ##[debug]File: 27/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_1.10_32c20bd2f2b0e279_.log took 563.607 milliseconds to finish upload 2024-04-03T12:45:53.6390450Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_2.10_dbac769edb38711e_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-RpcdOPfWW2uV to potentially reduce the upload size 2024-04-03T12:45:53.6621601Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_2.10_dbac769edb38711e_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:54.0369978Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-fgYhZ8AVpoPT 2024-04-03T12:45:54.0375065Z ##[debug]File: 28/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_10.10_21964ae873e2930d_.log took 553.072 milliseconds to finish upload 2024-04-03T12:45:54.0386579Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_3.10_c151b39a64edb912_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-SJ0PEo5caUw4 to potentially reduce the upload size 2024-04-03T12:45:54.0621372Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_3.10_c151b39a64edb912_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:54.2489814Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-RpcdOPfWW2uV 2024-04-03T12:45:54.2495362Z ##[debug]File: 29/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_2.10_dbac769edb38711e_.log took 611.415 milliseconds to finish upload 2024-04-03T12:45:54.2507739Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_4.10_dec7da1c824451ee_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-SF1e60MmY3G6 to potentially reduce the upload size 2024-04-03T12:45:54.2751620Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_4.10_dec7da1c824451ee_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:54.5973569Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-SJ0PEo5caUw4 2024-04-03T12:45:54.5979469Z ##[debug]File: 30/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_3.10_c151b39a64edb912_.log took 560.321 milliseconds to finish upload 2024-04-03T12:45:54.5988342Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_5.10_d6135ccdc22f06c9_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-GlM0FHjnkv63 to potentially reduce the upload size 2024-04-03T12:45:54.6225405Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_5.10_d6135ccdc22f06c9_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:54.8141857Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-SF1e60MmY3G6 2024-04-03T12:45:54.8147825Z ##[debug]File: 31/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_4.10_dec7da1c824451ee_.log took 565.190 milliseconds to finish upload 2024-04-03T12:45:54.8156669Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_6.10_4e3eaa86326da093_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-6mi6wVCvXjvB to potentially reduce the upload size 2024-04-03T12:45:54.8420063Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_6.10_4e3eaa86326da093_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:55.1430915Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-GlM0FHjnkv63 2024-04-03T12:45:55.1436324Z ##[debug]File: 32/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_5.10_d6135ccdc22f06c9_.log took 545.711 milliseconds to finish upload 2024-04-03T12:45:55.1448381Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_7.10_42fb8f0437415d49_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-M58T06a0T6a7 to potentially reduce the upload size 2024-04-03T12:45:55.1664852Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_7.10_42fb8f0437415d49_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:55.3600690Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-6mi6wVCvXjvB 2024-04-03T12:45:55.3605728Z ##[debug]File: 33/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_6.10_4e3eaa86326da093_.log took 545.678 milliseconds to finish upload 2024-04-03T12:45:55.3613749Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_8.10_abe63000578516da_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-61wqzb2eXIvv to potentially reduce the upload size 2024-04-03T12:45:55.3822616Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_8.10_abe63000578516da_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:55.7151034Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-M58T06a0T6a7 2024-04-03T12:45:55.7158270Z ##[debug]File: 34/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_7.10_42fb8f0437415d49_.log took 572.117 milliseconds to finish upload 2024-04-03T12:45:55.7167141Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_9.10_9497e0f5766be958_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-r85hvdgRtuDk to potentially reduce the upload size 2024-04-03T12:45:55.7403498Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_9.10_9497e0f5766be958_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:55.8977668Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-61wqzb2eXIvv 2024-04-03T12:45:55.8982911Z ##[debug]File: 35/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_8.10_abe63000578516da_.log took 537.497 milliseconds to finish upload 2024-04-03T12:45:55.8991244Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_torch_1.1_88a66d12edd14a59_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-8yNE1uFrpeVE to potentially reduce the upload size 2024-04-03T12:45:55.9075123Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_torch_1.1_88a66d12edd14a59_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:56.2308374Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-r85hvdgRtuDk 2024-04-03T12:45:56.2314256Z ##[debug]File: 36/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_ops_9.10_9497e0f5766be958_.log took 515.296 milliseconds to finish upload 2024-04-03T12:45:56.2323315Z ##[debug]/home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_unary_ufuncs_1.1_91fe2da0719b659f_.log is greater than 64k in size. Creating a gzip file on-disk /tmp/tmp-3142739-Hq0B5CtWL3SR to potentially reduce the upload size 2024-04-03T12:45:56.3320792Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-8yNE1uFrpeVE 2024-04-03T12:45:56.3364223Z ##[debug]File: 37/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_torch_1.1_88a66d12edd14a59_.log took 438.116 milliseconds to finish upload 2024-04-03T12:45:56.3626386Z ##[debug]The gzip file created for /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_unary_ufuncs_1.1_91fe2da0719b659f_.log is smaller than the original file. The file will be uploaded using gzip. 2024-04-03T12:45:57.1773743Z ##[debug]deleting temporary gzip file /tmp/tmp-3142739-Hq0B5CtWL3SR 2024-04-03T12:45:57.1779373Z ##[debug]File: 38/38. /home/pytorchci/actions-runner/_work/pytorch/pytorch/test/test-reports/test_unary_ufuncs_1.1_91fe2da0719b659f_.log took 946.546 milliseconds to finish upload 2024-04-03T12:45:57.1785002Z Total size of all the files uploaded is 1077854 bytes 2024-04-03T12:45:57.1787477Z File upload process has finished. Finalizing the artifact upload 2024-04-03T12:45:57.1793775Z ##[debug]Artifact Url: https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview 2024-04-03T12:45:57.1803879Z ##[debug]URL is https://pipelinesghubeus21.actions.githubusercontent.com/mBh68xKhi8LyM7tp3vECvYXNFvuV4gyVGgmYCteuEZP9JH92QN/_apis/pipelines/workflows/8536952077/artifacts?api-version=6.0-preview&artifactName=logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip 2024-04-03T12:45:57.4814209Z ##[debug]Artifact logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded, total size in bytes: 18750913 2024-04-03T12:45:57.4818394Z Artifact has been finalized. All files have been successfully uploaded! 2024-04-03T12:45:57.4820722Z 2024-04-03T12:45:57.4822330Z The raw size of all the files that were specified for upload is 18750913 bytes 2024-04-03T12:45:57.4826735Z The size of all the files that were uploaded is 1077854 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-04-03T12:45:57.4829901Z 2024-04-03T12:45:57.4833660Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-04-03T12:45:57.4837326Z 2024-04-03T12:45:57.4839429Z Artifact logs-runattempt1-test-default-1-1-linux.rocm.gpu_23389745826.zip has been successfully uploaded! 2024-04-03T12:45:57.4978441Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:45:57.4992037Z ##[debug]Finished: run 2024-04-03T12:45:57.5007297Z ##[debug]Finishing: Upload test artifacts 2024-04-03T12:45:57.5037563Z ##[debug]Evaluating condition for step: 'Collect backtraces from coredumps (if any)' 2024-04-03T12:45:57.5043168Z ##[debug]Evaluating: always() 2024-04-03T12:45:57.5045464Z ##[debug]Evaluating always: 2024-04-03T12:45:57.5047755Z ##[debug]=> true 2024-04-03T12:45:57.5050294Z ##[debug]Result: true 2024-04-03T12:45:57.5053427Z ##[debug]Starting: Collect backtraces from coredumps (if any) 2024-04-03T12:45:57.5076829Z ##[debug]Loading inputs 2024-04-03T12:45:57.5078287Z ##[debug]Loading env 2024-04-03T12:45:57.5087051Z ##[group]Run # shellcheck disable=SC2156 2024-04-03T12:45:57.5087868Z # shellcheck disable=SC2156 2024-04-03T12:45:57.5088986Z find . -iname "core.[1-9]*" -exec docker exec "${CONTAINER_NAME}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2024-04-03T12:45:57.5122692Z shell: /usr/bin/bash -e {0} 2024-04-03T12:45:57.5123348Z env: 2024-04-03T12:45:57.5123904Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:57.5124720Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:57.5125937Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:57.5127003Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:57.5127634Z AWS_REGION: us-east-1 2024-04-03T12:45:57.5128349Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:57.5129121Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:57.5137275Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:57.5138133Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:57.5139051Z ##[endgroup] 2024-04-03T12:45:57.5167575Z ##[debug]/usr/bin/bash -e /home/pytorchci/actions-runner/_work/_temp/b9fc8ed6-41eb-4475-88bf-89d2290b8e2c.sh 2024-04-03T12:45:57.7283741Z ##[debug]Finishing: Collect backtraces from coredumps (if any) 2024-04-03T12:45:57.7308860Z ##[debug]Evaluating condition for step: 'Store Core dumps on GitHub' 2024-04-03T12:45:57.7313517Z ##[debug]Evaluating: failure() 2024-04-03T12:45:57.7315288Z ##[debug]Evaluating failure: 2024-04-03T12:45:57.7317246Z ##[debug]=> false 2024-04-03T12:45:57.7318941Z ##[debug]Result: false 2024-04-03T12:45:57.7336969Z ##[debug]Evaluating condition for step: 'Teardown ROCm' 2024-04-03T12:45:57.7340502Z ##[debug]Evaluating: success() 2024-04-03T12:45:57.7342274Z ##[debug]Evaluating success: 2024-04-03T12:45:57.7344033Z ##[debug]=> true 2024-04-03T12:45:57.7345704Z ##[debug]Result: true 2024-04-03T12:45:57.7347659Z ##[debug]Starting: Teardown ROCm 2024-04-03T12:45:57.7396951Z Prepare all required actions 2024-04-03T12:45:57.7397481Z Getting action download info 2024-04-03T12:45:57.7417749Z ##[debug]Register post job cleanup for action: /./.github/actions/teardown-rocm 2024-04-03T12:45:57.7433058Z ##[debug]Loading inputs 2024-04-03T12:45:57.7433865Z ##[debug]Loading env 2024-04-03T12:45:57.7440493Z ##[group]Run ./.github/actions/teardown-rocm 2024-04-03T12:45:57.7441042Z env: 2024-04-03T12:45:57.7441416Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:57.7441951Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:57.7442972Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:57.7443999Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:57.7444503Z AWS_REGION: us-east-1 2024-04-03T12:45:57.7445055Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:57.7445693Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:57.7453717Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:57.7454427Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:57.7455193Z ##[endgroup] 2024-04-03T12:45:57.7457942Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:45:57.7459149Z ##[debug]Evaluating: always() 2024-04-03T12:45:57.7459859Z ##[debug]Evaluating always: 2024-04-03T12:45:57.7460506Z ##[debug]=> true 2024-04-03T12:45:57.7461147Z ##[debug]Result: true 2024-04-03T12:45:57.7461866Z ##[debug]Starting: run 2024-04-03T12:45:57.7471787Z ##[debug]Loading inputs 2024-04-03T12:45:57.7472874Z ##[debug]Loading env 2024-04-03T12:45:57.7479569Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-04-03T12:45:57.7480528Z # ignore expansion of "docker ps -q" since it could be empty 2024-04-03T12:45:57.7481279Z # shellcheck disable=SC2046 2024-04-03T12:45:57.7481987Z docker stop $(docker ps -q) || true 2024-04-03T12:45:57.7482606Z # Prune all stopped containers. 2024-04-03T12:45:57.7483189Z docker container prune -f 2024-04-03T12:45:57.7515676Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T12:45:57.7516313Z env: 2024-04-03T12:45:57.7516722Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:45:57.7517266Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:45:57.7518214Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:45:57.7519124Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:45:57.7519619Z AWS_REGION: us-east-1 2024-04-03T12:45:57.7520140Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:45:57.7520778Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:45:57.7528819Z AWS_SESSION_TOKEN: *** 2024-04-03T12:45:57.7529527Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:45:57.7530295Z ##[endgroup] 2024-04-03T12:45:57.7557757Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/1b928654-7569-4e06-989a-cceda7f95bc2.sh 2024-04-03T12:45:58.4580593Z 48ab1b864422 2024-04-03T12:46:12.6016535Z Deleted Containers: 2024-04-03T12:46:12.6017443Z 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:46:12.6018103Z 2024-04-03T12:46:12.6018329Z Total reclaimed space: 7.136GB 2024-04-03T12:46:12.6081545Z ##[debug]Finished: run 2024-04-03T12:46:12.6090345Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:46:12.6094772Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.6096993Z ##[debug]Evaluating always: 2024-04-03T12:46:12.6099223Z ##[debug]=> true 2024-04-03T12:46:12.6101639Z ##[debug]Result: true 2024-04-03T12:46:12.6107883Z ##[debug]Starting: run 2024-04-03T12:46:12.6164255Z Prepare all required actions 2024-04-03T12:46:12.6232973Z ##[debug]Loading inputs 2024-04-03T12:46:12.6235325Z ##[debug]Loading env 2024-04-03T12:46:12.6243732Z ##[group]Run ./.github/actions/diskspace-cleanup 2024-04-03T12:46:12.6244456Z with: 2024-04-03T12:46:12.6244984Z diskspace-cutoff: 70 2024-04-03T12:46:12.6245556Z env: 2024-04-03T12:46:12.6246064Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:46:12.6246747Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:46:12.6247851Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:46:12.6248912Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:46:12.6249554Z AWS_REGION: us-east-1 2024-04-03T12:46:12.6250375Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:46:12.6251161Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:46:12.6259275Z AWS_SESSION_TOKEN: *** 2024-04-03T12:46:12.6260214Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:46:12.6261156Z ##[endgroup] 2024-04-03T12:46:12.6264295Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:46:12.6265983Z ##[debug]Evaluating: success() 2024-04-03T12:46:12.6266951Z ##[debug]Evaluating success: 2024-04-03T12:46:12.6267849Z ##[debug]=> true 2024-04-03T12:46:12.6268716Z ##[debug]Result: true 2024-04-03T12:46:12.6269648Z ##[debug]Starting: run 2024-04-03T12:46:12.6282341Z ##[debug]Loading inputs 2024-04-03T12:46:12.6284256Z ##[debug]Evaluating: format('diskspace_cutoff={0} 2024-04-03T12:46:12.6285287Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6286893Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T12:46:12.6288416Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6289211Z ##[debug] docker system prune -af 2024-04-03T12:46:12.6290174Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6291293Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6292714Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T12:46:12.6293787Z ##[debug] echo "$msg" 2024-04-03T12:46:12.6294374Z ##[debug] exit 1 2024-04-03T12:46:12.6294928Z ##[debug] else 2024-04-03T12:46:12.6295577Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T12:46:12.6296451Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T12:46:12.6297209Z ##[debug] fi 2024-04-03T12:46:12.6297717Z ##[debug]fi 2024-04-03T12:46:12.6298254Z ##[debug]', inputs.diskspace-cutoff) 2024-04-03T12:46:12.6298936Z ##[debug]Evaluating format: 2024-04-03T12:46:12.6299555Z ##[debug]..Evaluating String: 2024-04-03T12:46:12.6300506Z ##[debug]..=> 'diskspace_cutoff={0} 2024-04-03T12:46:12.6301424Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6303030Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T12:46:12.6304480Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6305287Z ##[debug] docker system prune -af 2024-04-03T12:46:12.6306231Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6307343Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6308567Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T12:46:12.6309626Z ##[debug] echo "$msg" 2024-04-03T12:46:12.6310225Z ##[debug] exit 1 2024-04-03T12:46:12.6310844Z ##[debug] else 2024-04-03T12:46:12.6311481Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T12:46:12.6312495Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T12:46:12.6313266Z ##[debug] fi 2024-04-03T12:46:12.6313792Z ##[debug]fi 2024-04-03T12:46:12.6314281Z ##[debug]' 2024-04-03T12:46:12.6314992Z ##[debug]..Evaluating Index: 2024-04-03T12:46:12.6315621Z ##[debug]....Evaluating inputs: 2024-04-03T12:46:12.6316258Z ##[debug]....=> Object 2024-04-03T12:46:12.6316943Z ##[debug]....Evaluating String: 2024-04-03T12:46:12.6317597Z ##[debug]....=> 'diskspace-cutoff' 2024-04-03T12:46:12.6318265Z ##[debug]..=> '70' 2024-04-03T12:46:12.6318948Z ##[debug]=> 'diskspace_cutoff=70 2024-04-03T12:46:12.6319851Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6321521Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T12:46:12.6322973Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6323790Z ##[debug] docker system prune -af 2024-04-03T12:46:12.6324833Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6325947Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6327224Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T12:46:12.6328293Z ##[debug] echo "$msg" 2024-04-03T12:46:12.6328876Z ##[debug] exit 1 2024-04-03T12:46:12.6329426Z ##[debug] else 2024-04-03T12:46:12.6330072Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T12:46:12.6330943Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T12:46:12.6331712Z ##[debug] fi 2024-04-03T12:46:12.6332163Z ##[debug]fi 2024-04-03T12:46:12.6332552Z ##[debug]' 2024-04-03T12:46:12.6333396Z ##[debug]Result: 'diskspace_cutoff=70 2024-04-03T12:46:12.6334286Z ##[debug]diskspace=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6335826Z ##[debug]msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T12:46:12.6337260Z ##[debug]if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6337956Z ##[debug] docker system prune -af 2024-04-03T12:46:12.6338827Z ##[debug] diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed ''s/%//'' | sed ''s/ //'') 2024-04-03T12:46:12.6340034Z ##[debug] if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6341209Z ##[debug] echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T12:46:12.6342168Z ##[debug] echo "$msg" 2024-04-03T12:46:12.6342663Z ##[debug] exit 1 2024-04-03T12:46:12.6343133Z ##[debug] else 2024-04-03T12:46:12.6343747Z ##[debug] difference=$((diskspace - diskspace_new)) 2024-04-03T12:46:12.6344536Z ##[debug] echo "Diskspace saved: $difference percent" 2024-04-03T12:46:12.6345182Z ##[debug] fi 2024-04-03T12:46:12.6345604Z ##[debug]fi 2024-04-03T12:46:12.6346012Z ##[debug]' 2024-04-03T12:46:12.6346571Z ##[debug]Loading env 2024-04-03T12:46:12.6353948Z ##[group]Run diskspace_cutoff=70 2024-04-03T12:46:12.6354582Z diskspace_cutoff=70 2024-04-03T12:46:12.6355347Z diskspace=$(df -H / --output=pcent | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-04-03T12:46:12.6356852Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-04-03T12:46:12.6358257Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6359030Z  docker system prune -af 2024-04-03T12:46:12.6359905Z  diskspace_new=$(df -H / --output=pcent | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-04-03T12:46:12.6360912Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-04-03T12:46:12.6362180Z  echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-04-03T12:46:12.6363211Z  echo "$msg" 2024-04-03T12:46:12.6363707Z  exit 1 2024-04-03T12:46:12.6364179Z  else 2024-04-03T12:46:12.6364754Z  difference=$((diskspace - diskspace_new)) 2024-04-03T12:46:12.6365537Z  echo "Diskspace saved: $difference percent" 2024-04-03T12:46:12.6366312Z  fi 2024-04-03T12:46:12.6366725Z fi 2024-04-03T12:46:12.6399864Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-04-03T12:46:12.6400636Z env: 2024-04-03T12:46:12.6401115Z GIT_DEFAULT_BRANCH: main 2024-04-03T12:46:12.6401677Z DOCKER_HOST: unix:///run/user/1123/docker.sock 2024-04-03T12:46:12.6402681Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-04-03T12:46:12.6403643Z AWS_DEFAULT_REGION: us-east-1 2024-04-03T12:46:12.6404277Z AWS_REGION: us-east-1 2024-04-03T12:46:12.6404987Z AWS_ACCESS_KEY_ID: *** 2024-04-03T12:46:12.6405656Z AWS_SECRET_ACCESS_KEY: *** 2024-04-03T12:46:12.6413645Z AWS_SESSION_TOKEN: *** 2024-04-03T12:46:12.6616121Z CONTAINER_NAME: 48ab1b864422c19c81d27154005b78a378efa0fe4072190c5fd589b476c06176 2024-04-03T12:46:12.6617142Z ##[endgroup] 2024-04-03T12:46:12.6657283Z ##[debug]/usr/bin/bash --noprofile --norc -e -o pipefail /home/pytorchci/actions-runner/_work/_temp/be9750a8-7408-41ab-b6ee-abb83587fafd.sh 2024-04-03T12:46:12.6769054Z ##[debug]Finished: run 2024-04-03T12:46:12.6780105Z ##[debug]Finished: run 2024-04-03T12:46:12.6792687Z ##[debug]Finishing: Teardown ROCm 2024-04-03T12:46:12.6820238Z ##[debug]Evaluating condition for step: 'Post Teardown ROCm' 2024-04-03T12:46:12.6824241Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.6825911Z ##[debug]Evaluating always: 2024-04-03T12:46:12.6827599Z ##[debug]=> true 2024-04-03T12:46:12.6829255Z ##[debug]Result: true 2024-04-03T12:46:12.6831304Z ##[debug]Starting: Post Teardown ROCm 2024-04-03T12:46:12.6884862Z ##[debug]Loading inputs 2024-04-03T12:46:12.6885666Z ##[debug]Loading env 2024-04-03T12:46:12.6892362Z Post job cleanup. 2024-04-03T12:46:12.6895897Z ##[debug]Finishing: Post Teardown ROCm 2024-04-03T12:46:12.6905944Z ##[debug]Evaluating condition for step: 'Post Login to Amazon ECR' 2024-04-03T12:46:12.6907671Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.6908477Z ##[debug]Evaluating always: 2024-04-03T12:46:12.6909173Z ##[debug]=> true 2024-04-03T12:46:12.6909854Z ##[debug]Result: true 2024-04-03T12:46:12.6910575Z ##[debug]Starting: Post Login to Amazon ECR 2024-04-03T12:46:12.6936675Z ##[debug]Loading inputs 2024-04-03T12:46:12.6941169Z ##[debug]Loading env 2024-04-03T12:46:12.6947379Z Post job cleanup. 2024-04-03T12:46:12.7441104Z Logging out of registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-04-03T12:46:12.8007234Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:46:12.8013522Z ##[debug]Finishing: Post Login to Amazon ECR 2024-04-03T12:46:12.8035924Z ##[debug]Evaluating condition for step: 'Post configure aws credentials' 2024-04-03T12:46:12.8039160Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.8040541Z ##[debug]Evaluating always: 2024-04-03T12:46:12.8041798Z ##[debug]=> true 2024-04-03T12:46:12.8043087Z ##[debug]Result: true 2024-04-03T12:46:12.8044749Z ##[debug]Starting: Post configure aws credentials 2024-04-03T12:46:12.8132642Z ##[debug]Loading inputs 2024-04-03T12:46:12.8153493Z ##[debug]Loading env 2024-04-03T12:46:12.8164987Z Post job cleanup. 2024-04-03T12:46:12.9725636Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:46:12.9728747Z ##[debug]AWS_ACCESS_KEY_ID='' 2024-04-03T12:46:12.9730122Z ##[debug]AWS_SECRET_ACCESS_KEY='' 2024-04-03T12:46:12.9731488Z ##[debug]AWS_SESSION_TOKEN='' 2024-04-03T12:46:12.9732785Z ##[debug]AWS_DEFAULT_REGION='' 2024-04-03T12:46:12.9734051Z ##[debug]AWS_REGION='' 2024-04-03T12:46:12.9740006Z ##[debug]Finishing: Post configure aws credentials 2024-04-03T12:46:12.9764878Z ##[debug]Evaluating condition for step: 'Post Setup ROCm' 2024-04-03T12:46:12.9768547Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.9770270Z ##[debug]Evaluating always: 2024-04-03T12:46:12.9771853Z ##[debug]=> true 2024-04-03T12:46:12.9773574Z ##[debug]Result: true 2024-04-03T12:46:12.9775561Z ##[debug]Starting: Post Setup ROCm 2024-04-03T12:46:12.9973282Z ##[debug]Loading inputs 2024-04-03T12:46:12.9974088Z ##[debug]Loading env 2024-04-03T12:46:12.9980402Z Post job cleanup. 2024-04-03T12:46:12.9983475Z ##[debug]Finishing: Post Setup ROCm 2024-04-03T12:46:12.9992767Z ##[debug]Evaluating condition for step: 'Post Checkout PyTorch' 2024-04-03T12:46:12.9994300Z ##[debug]Evaluating: always() 2024-04-03T12:46:12.9994943Z ##[debug]Evaluating always: 2024-04-03T12:46:12.9995558Z ##[debug]=> true 2024-04-03T12:46:12.9996176Z ##[debug]Result: true 2024-04-03T12:46:12.9996938Z ##[debug]Starting: Post Checkout PyTorch 2024-04-03T12:46:13.0043733Z ##[debug]Loading inputs 2024-04-03T12:46:13.0046694Z ##[debug]Loading env 2024-04-03T12:46:13.0052717Z Post job cleanup. 2024-04-03T12:46:13.0055106Z ##[debug]Evaluating condition for step: 'run' 2024-04-03T12:46:13.0056259Z ##[debug]Evaluating: always() 2024-04-03T12:46:13.0056872Z ##[debug]Evaluating always: 2024-04-03T12:46:13.0057459Z ##[debug]=> true 2024-04-03T12:46:13.0058044Z ##[debug]Result: true 2024-04-03T12:46:13.0058691Z ##[debug]Starting: run 2024-04-03T12:46:13.0097776Z ##[debug]Loading inputs 2024-04-03T12:46:13.0099552Z ##[debug]Evaluating: (((github.event_name == 'pull_request') && github.event.pull_request.head.sha) || github.sha) 2024-04-03T12:46:13.0100590Z ##[debug]Evaluating Or: 2024-04-03T12:46:13.0101050Z ##[debug]..Evaluating And: 2024-04-03T12:46:13.0101513Z ##[debug]....Evaluating Equal: 2024-04-03T12:46:13.0102004Z ##[debug]......Evaluating Index: 2024-04-03T12:46:13.0102494Z ##[debug]........Evaluating github: 2024-04-03T12:46:13.0103001Z ##[debug]........=> Object 2024-04-03T12:46:13.0103463Z ##[debug]........Evaluating String: 2024-04-03T12:46:13.0103973Z ##[debug]........=> 'event_name' 2024-04-03T12:46:13.0104478Z ##[debug]......=> 'push' 2024-04-03T12:46:13.0104926Z ##[debug]......Evaluating String: 2024-04-03T12:46:13.0105588Z ##[debug]......=> 'pull_request' 2024-04-03T12:46:13.0106065Z ##[debug]....=> false 2024-04-03T12:46:13.0106467Z ##[debug]..=> false 2024-04-03T12:46:13.0106871Z ##[debug]..Evaluating Index: 2024-04-03T12:46:13.0107330Z ##[debug]....Evaluating github: 2024-04-03T12:46:13.0107798Z ##[debug]....=> Object 2024-04-03T12:46:13.0108234Z ##[debug]....Evaluating String: 2024-04-03T12:46:13.0108696Z ##[debug]....=> 'sha' 2024-04-03T12:46:13.0109226Z ##[debug]..=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T12:46:13.0109938Z ##[debug]=> 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T12:46:13.0111125Z ##[debug]Expanded: ((('push' == 'pull_request') && github['event']['pull_request']['head']['sha']) || 'd253bd7befc64137de913dbb8727b511f1052411') 2024-04-03T12:46:13.0112265Z ##[debug]Result: 'd253bd7befc64137de913dbb8727b511f1052411' 2024-04-03T12:46:13.0113074Z ##[debug]Evaluating: inputs.fetch-depth 2024-04-03T12:46:13.0113604Z ##[debug]Evaluating Index: 2024-04-03T12:46:13.0114065Z ##[debug]..Evaluating inputs: 2024-04-03T12:46:13.0114519Z ##[debug]..=> Object 2024-04-03T12:46:13.0114935Z ##[debug]..Evaluating String: 2024-04-03T12:46:13.0115397Z ##[debug]..=> 'fetch-depth' 2024-04-03T12:46:13.0115847Z ##[debug]=> '0' 2024-04-03T12:46:13.0116240Z ##[debug]Result: '0' 2024-04-03T12:46:13.0116778Z ##[debug]Evaluating: inputs.submodules 2024-04-03T12:46:13.0117308Z ##[debug]Evaluating Index: 2024-04-03T12:46:13.0117759Z ##[debug]..Evaluating inputs: 2024-04-03T12:46:13.0118217Z ##[debug]..=> Object 2024-04-03T12:46:13.0118640Z ##[debug]..Evaluating String: 2024-04-03T12:46:13.0119095Z ##[debug]..=> 'submodules' 2024-04-03T12:46:13.0119538Z ##[debug]=> 'recursive' 2024-04-03T12:46:13.0119971Z ##[debug]Result: 'recursive' 2024-04-03T12:46:13.0120756Z ##[debug]Evaluating: github.repository 2024-04-03T12:46:13.0121282Z ##[debug]Evaluating Index: 2024-04-03T12:46:13.0121733Z ##[debug]..Evaluating github: 2024-04-03T12:46:13.0122185Z ##[debug]..=> Object 2024-04-03T12:46:13.0122606Z ##[debug]..Evaluating String: 2024-04-03T12:46:13.0123069Z ##[debug]..=> 'repository' 2024-04-03T12:46:13.0123531Z ##[debug]=> 'pytorch/pytorch' 2024-04-03T12:46:13.0124017Z ##[debug]Result: 'pytorch/pytorch' 2024-04-03T12:46:13.0125610Z ##[debug]Evaluating: github.token 2024-04-03T12:46:13.0126107Z ##[debug]Evaluating Index: 2024-04-03T12:46:13.0126564Z ##[debug]..Evaluating github: 2024-04-03T12:46:13.0127166Z ##[debug]..=> Object 2024-04-03T12:46:13.0127594Z ##[debug]..Evaluating String: 2024-04-03T12:46:13.0128235Z ##[debug]..=> 'token' 2024-04-03T12:46:13.0128856Z ##[debug]=> '***' 2024-04-03T12:46:13.0129420Z ##[debug]Result: '***' 2024-04-03T12:46:13.0138597Z ##[debug]Loading env 2024-04-03T12:46:13.0146701Z Post job cleanup. 2024-04-03T12:46:13.1265080Z ##[debug]Getting git version 2024-04-03T12:46:13.1281469Z [command]/usr/bin/git version 2024-04-03T12:46:13.1341893Z git version 2.39.0 2024-04-03T12:46:13.1363432Z ##[debug]0 2024-04-03T12:46:13.1365246Z ##[debug]git version 2.39.0 2024-04-03T12:46:13.1366468Z ##[debug] 2024-04-03T12:46:13.1369087Z ##[debug]Set git useragent to: git/2.39.0 (github-actions-checkout) 2024-04-03T12:46:13.1372532Z ::add-mask::*** 2024-04-03T12:46:13.1394395Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/7a1c5fd3-3559-4839-914a-e213f0488070' before making global git config changes 2024-04-03T12:46:13.1398868Z Adding repository directory to the temporary git global config as a safe directory 2024-04-03T12:46:13.1402841Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-04-03T12:46:13.1447360Z ##[debug]0 2024-04-03T12:46:13.1449126Z ##[debug] 2024-04-03T12:46:13.1454110Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-04-03T12:46:13.1508728Z ##[debug]1 2024-04-03T12:46:13.1509919Z ##[debug] 2024-04-03T12:46:13.1513989Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-04-03T12:46:13.1958668Z Entering 'android/libs/fbjni' 2024-04-03T12:46:13.2040309Z Entering 'third_party/FP16' 2024-04-03T12:46:13.2111041Z Entering 'third_party/FXdiv' 2024-04-03T12:46:13.2181743Z Entering 'third_party/NNPACK' 2024-04-03T12:46:13.2283723Z Entering 'third_party/QNNPACK' 2024-04-03T12:46:13.2365942Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T12:46:13.2444056Z Entering 'third_party/XNNPACK' 2024-04-03T12:46:13.2556502Z Entering 'third_party/benchmark' 2024-04-03T12:46:13.2644748Z Entering 'third_party/cpuinfo' 2024-04-03T12:46:13.2745499Z Entering 'third_party/cub' 2024-04-03T12:46:13.2827893Z Entering 'third_party/cudnn_frontend' 2024-04-03T12:46:13.2924187Z Entering 'third_party/cutlass' 2024-04-03T12:46:13.3023691Z Entering 'third_party/eigen' 2024-04-03T12:46:13.3131056Z Entering 'third_party/fbgemm' 2024-04-03T12:46:13.3195848Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T12:46:13.3268726Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T12:46:13.3358055Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T12:46:13.3441821Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T12:46:13.3503791Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T12:46:13.3598064Z Entering 'third_party/flatbuffers' 2024-04-03T12:46:13.3697837Z Entering 'third_party/fmt' 2024-04-03T12:46:13.3782711Z Entering 'third_party/foxi' 2024-04-03T12:46:13.3874355Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T12:46:13.3964810Z Entering 'third_party/gloo' 2024-04-03T12:46:13.4060913Z Entering 'third_party/googletest' 2024-04-03T12:46:13.4154495Z Entering 'third_party/ideep' 2024-04-03T12:46:13.4241244Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T12:46:13.4331640Z Entering 'third_party/ios-cmake' 2024-04-03T12:46:13.4421873Z Entering 'third_party/ittapi' 2024-04-03T12:46:13.4463432Z Entering 'third_party/kineto' 2024-04-03T12:46:13.4540612Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T12:46:13.4613425Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T12:46:13.4678638Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T12:46:13.4735735Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T12:46:13.4793736Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T12:46:13.4872396Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T12:46:13.4968254Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T12:46:13.5047802Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T12:46:13.5109747Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T12:46:13.5184909Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T12:46:13.5255431Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T12:46:13.5340497Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T12:46:13.5432961Z Entering 'third_party/mimalloc' 2024-04-03T12:46:13.5516490Z Entering 'third_party/nccl/nccl' 2024-04-03T12:46:13.5580520Z Entering 'third_party/neon2sse' 2024-04-03T12:46:13.5671342Z Entering 'third_party/nlohmann' 2024-04-03T12:46:13.5753880Z Entering 'third_party/onnx' 2024-04-03T12:46:13.5847145Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T12:46:13.5936654Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T12:46:13.6032923Z Entering 'third_party/onnx-tensorrt' 2024-04-03T12:46:13.6123877Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T12:46:13.6217395Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T12:46:13.6309179Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T12:46:13.6399812Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T12:46:13.6506791Z Entering 'third_party/pocketfft' 2024-04-03T12:46:13.6592657Z Entering 'third_party/protobuf' 2024-04-03T12:46:13.6648127Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T12:46:13.6727402Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T12:46:13.6823599Z Entering 'third_party/psimd' 2024-04-03T12:46:13.6906761Z Entering 'third_party/pthreadpool' 2024-04-03T12:46:13.6976136Z Entering 'third_party/pybind11' 2024-04-03T12:46:13.7037274Z Entering 'third_party/python-peachpy' 2024-04-03T12:46:13.7099950Z Entering 'third_party/sleef' 2024-04-03T12:46:13.7158999Z Entering 'third_party/tbb' 2024-04-03T12:46:13.7204839Z Entering 'third_party/tensorpipe' 2024-04-03T12:46:13.7255225Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T12:46:13.7331673Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T12:46:13.7419225Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T12:46:13.7503008Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T12:46:13.7581028Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T12:46:13.7667851Z Entering 'third_party/zstd' 2024-04-03T12:46:13.7782369Z ##[debug]0 2024-04-03T12:46:13.7788273Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T12:46:13.7789704Z ##[debug]Entering 'third_party/FP16' 2024-04-03T12:46:13.7791145Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T12:46:13.7792561Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T12:46:13.7793984Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T12:46:13.7795617Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T12:46:13.7797276Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T12:46:13.7798763Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T12:46:13.7800247Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T12:46:13.7801680Z ##[debug]Entering 'third_party/cub' 2024-04-03T12:46:13.7803147Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T12:46:13.7804673Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T12:46:13.7806107Z ##[debug]Entering 'third_party/eigen' 2024-04-03T12:46:13.7807502Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T12:46:13.7809144Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T12:46:13.7811072Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T12:46:13.7813489Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T12:46:13.7815501Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T12:46:13.7817576Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T12:46:13.7819402Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T12:46:13.7821027Z ##[debug]Entering 'third_party/fmt' 2024-04-03T12:46:13.7822391Z ##[debug]Entering 'third_party/foxi' 2024-04-03T12:46:13.7824134Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T12:46:13.7825685Z ##[debug]Entering 'third_party/gloo' 2024-04-03T12:46:13.7827133Z ##[debug]Entering 'third_party/googletest' 2024-04-03T12:46:13.7828622Z ##[debug]Entering 'third_party/ideep' 2024-04-03T12:46:13.7830094Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T12:46:13.7831631Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T12:46:13.7833088Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T12:46:13.7834515Z ##[debug]Entering 'third_party/kineto' 2024-04-03T12:46:13.7836330Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T12:46:13.7838859Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T12:46:13.7841673Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T12:46:13.7844486Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T12:46:13.7847329Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T12:46:13.7850257Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T12:46:13.7853361Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T12:46:13.7856264Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T12:46:13.7859191Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T12:46:13.7862124Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T12:46:13.7864611Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T12:46:13.7866853Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T12:46:13.7868777Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T12:46:13.7870242Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T12:46:13.7871716Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T12:46:13.7873165Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T12:46:13.7874601Z ##[debug]Entering 'third_party/onnx' 2024-04-03T12:46:13.7876229Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T12:46:13.7878156Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T12:46:13.7879936Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T12:46:13.7881763Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T12:46:13.7884190Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T12:46:13.7886958Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T12:46:13.7889902Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T12:46:13.7892239Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T12:46:13.7893720Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T12:46:13.7895487Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T12:46:13.7897602Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T12:46:13.7899395Z ##[debug]Entering 'third_party/psimd' 2024-04-03T12:46:13.7901199Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T12:46:13.7902721Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T12:46:13.7904263Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T12:46:13.7905792Z ##[debug]Entering 'third_party/sleef' 2024-04-03T12:46:13.7907456Z ##[debug]Entering 'third_party/tbb' 2024-04-03T12:46:13.7908895Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T12:46:13.7910527Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T12:46:13.7911434Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T12:46:13.7912196Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T12:46:13.7912977Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T12:46:13.7913898Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T12:46:13.7914646Z ##[debug]Entering 'third_party/zstd' 2024-04-03T12:46:13.7915133Z ##[debug] 2024-04-03T12:46:13.7916056Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-04-03T12:46:13.7917022Z http.https://github.com/.extraheader 2024-04-03T12:46:13.7917811Z ##[debug]0 2024-04-03T12:46:13.7918521Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:13.7919083Z ##[debug] 2024-04-03T12:46:13.7919854Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2024-04-03T12:46:13.7925943Z ##[debug]0 2024-04-03T12:46:13.7926586Z ##[debug] 2024-04-03T12:46:13.7933579Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-04-03T12:46:13.8368417Z Entering 'android/libs/fbjni' 2024-04-03T12:46:13.8420686Z http.https://github.com/.extraheader 2024-04-03T12:46:13.8488799Z Entering 'third_party/FP16' 2024-04-03T12:46:13.8540345Z http.https://github.com/.extraheader 2024-04-03T12:46:13.8606747Z Entering 'third_party/FXdiv' 2024-04-03T12:46:13.8658350Z http.https://github.com/.extraheader 2024-04-03T12:46:13.8735063Z Entering 'third_party/NNPACK' 2024-04-03T12:46:13.8784008Z http.https://github.com/.extraheader 2024-04-03T12:46:13.8851842Z Entering 'third_party/QNNPACK' 2024-04-03T12:46:13.8901111Z http.https://github.com/.extraheader 2024-04-03T12:46:13.8963776Z Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T12:46:13.9010804Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9082061Z Entering 'third_party/XNNPACK' 2024-04-03T12:46:13.9125373Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9212902Z Entering 'third_party/benchmark' 2024-04-03T12:46:13.9262458Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9333508Z Entering 'third_party/cpuinfo' 2024-04-03T12:46:13.9386002Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9456980Z Entering 'third_party/cub' 2024-04-03T12:46:13.9504698Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9568204Z Entering 'third_party/cudnn_frontend' 2024-04-03T12:46:13.9615781Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9686514Z Entering 'third_party/cutlass' 2024-04-03T12:46:13.9736675Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9806902Z Entering 'third_party/eigen' 2024-04-03T12:46:13.9845854Z http.https://github.com/.extraheader 2024-04-03T12:46:13.9909591Z Entering 'third_party/fbgemm' 2024-04-03T12:46:13.9953304Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0012979Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T12:46:14.0061144Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0126382Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T12:46:14.0169302Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0228441Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T12:46:14.0264442Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0333397Z Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T12:46:14.0384185Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0444976Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T12:46:14.0491646Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0554669Z Entering 'third_party/flatbuffers' 2024-04-03T12:46:14.0595304Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0663976Z Entering 'third_party/fmt' 2024-04-03T12:46:14.0708269Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0778601Z Entering 'third_party/foxi' 2024-04-03T12:46:14.0827560Z http.https://github.com/.extraheader 2024-04-03T12:46:14.0896298Z Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T12:46:14.0945752Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1001036Z Entering 'third_party/gloo' 2024-04-03T12:46:14.1040274Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1100308Z Entering 'third_party/googletest' 2024-04-03T12:46:14.1144171Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1212366Z Entering 'third_party/ideep' 2024-04-03T12:46:14.1262312Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1328456Z Entering 'third_party/ideep/mkl-dnn' 2024-04-03T12:46:14.1377646Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1456680Z Entering 'third_party/ios-cmake' 2024-04-03T12:46:14.1506568Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1567777Z Entering 'third_party/ittapi' 2024-04-03T12:46:14.1617956Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1688517Z Entering 'third_party/kineto' 2024-04-03T12:46:14.1730784Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1791376Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T12:46:14.1841355Z http.https://github.com/.extraheader 2024-04-03T12:46:14.1909793Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T12:46:14.1957035Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2024852Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T12:46:14.2073906Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2139058Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T12:46:14.2178201Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2238768Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T12:46:14.2285339Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2343700Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T12:46:14.2391472Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2472313Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T12:46:14.2520370Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2575016Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T12:46:14.2623042Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2691164Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T12:46:14.2736371Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2791634Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T12:46:14.2840955Z http.https://github.com/.extraheader 2024-04-03T12:46:14.2905695Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T12:46:14.2954158Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3011793Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T12:46:14.3050214Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3106574Z Entering 'third_party/mimalloc' 2024-04-03T12:46:14.3157595Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3213708Z Entering 'third_party/nccl/nccl' 2024-04-03T12:46:14.3264293Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3331610Z Entering 'third_party/neon2sse' 2024-04-03T12:46:14.3378031Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3435539Z Entering 'third_party/nlohmann' 2024-04-03T12:46:14.3464296Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3524042Z Entering 'third_party/onnx' 2024-04-03T12:46:14.3557793Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3616088Z Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T12:46:14.3653316Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3701660Z Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T12:46:14.3742244Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3810056Z Entering 'third_party/onnx-tensorrt' 2024-04-03T12:46:14.3848618Z http.https://github.com/.extraheader 2024-04-03T12:46:14.3919139Z Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T12:46:14.3968272Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4054437Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T12:46:14.4100685Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4170106Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T12:46:14.4211652Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4265003Z Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T12:46:14.4311259Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4390478Z Entering 'third_party/pocketfft' 2024-04-03T12:46:14.4438643Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4499270Z Entering 'third_party/protobuf' 2024-04-03T12:46:14.4538809Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4603744Z Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T12:46:14.4653842Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4714988Z Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T12:46:14.4763569Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4833623Z Entering 'third_party/psimd' 2024-04-03T12:46:14.4884530Z http.https://github.com/.extraheader 2024-04-03T12:46:14.4952086Z Entering 'third_party/pthreadpool' 2024-04-03T12:46:14.5003165Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5062088Z Entering 'third_party/pybind11' 2024-04-03T12:46:14.5112405Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5183420Z Entering 'third_party/python-peachpy' 2024-04-03T12:46:14.5225629Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5287091Z Entering 'third_party/sleef' 2024-04-03T12:46:14.5339586Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5409190Z Entering 'third_party/tbb' 2024-04-03T12:46:14.5458104Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5525073Z Entering 'third_party/tensorpipe' 2024-04-03T12:46:14.5578086Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5649850Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T12:46:14.5699415Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5766718Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T12:46:14.5814302Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5880324Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T12:46:14.5916843Z http.https://github.com/.extraheader 2024-04-03T12:46:14.5983815Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T12:46:14.6027074Z http.https://github.com/.extraheader 2024-04-03T12:46:14.6087374Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T12:46:14.6124841Z http.https://github.com/.extraheader 2024-04-03T12:46:14.6195864Z Entering 'third_party/zstd' 2024-04-03T12:46:14.6245126Z http.https://github.com/.extraheader 2024-04-03T12:46:14.6342451Z ##[debug]0 2024-04-03T12:46:14.6351196Z ##[debug]Entering 'android/libs/fbjni' 2024-04-03T12:46:14.6352772Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6354351Z ##[debug]Entering 'third_party/FP16' 2024-04-03T12:46:14.6355938Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6357502Z ##[debug]Entering 'third_party/FXdiv' 2024-04-03T12:46:14.6359038Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6360618Z ##[debug]Entering 'third_party/NNPACK' 2024-04-03T12:46:14.6362166Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6363758Z ##[debug]Entering 'third_party/QNNPACK' 2024-04-03T12:46:14.6365328Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6367693Z ##[debug]Entering 'third_party/VulkanMemoryAllocator' 2024-04-03T12:46:14.6369502Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6371101Z ##[debug]Entering 'third_party/XNNPACK' 2024-04-03T12:46:14.6372663Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6374277Z ##[debug]Entering 'third_party/benchmark' 2024-04-03T12:46:14.6375849Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6377422Z ##[debug]Entering 'third_party/cpuinfo' 2024-04-03T12:46:14.6379183Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6380923Z ##[debug]Entering 'third_party/cub' 2024-04-03T12:46:14.6382453Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6384109Z ##[debug]Entering 'third_party/cudnn_frontend' 2024-04-03T12:46:14.6385766Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6387331Z ##[debug]Entering 'third_party/cutlass' 2024-04-03T12:46:14.6388906Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6390488Z ##[debug]Entering 'third_party/eigen' 2024-04-03T12:46:14.6392003Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6393564Z ##[debug]Entering 'third_party/fbgemm' 2024-04-03T12:46:14.6395101Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6396899Z ##[debug]Entering 'third_party/fbgemm/third_party/asmjit' 2024-04-03T12:46:14.6398802Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6400739Z ##[debug]Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-04-03T12:46:14.6402667Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6404573Z ##[debug]Entering 'third_party/fbgemm/third_party/cutlass' 2024-04-03T12:46:14.6406699Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6408676Z ##[debug]Entering 'third_party/fbgemm/third_party/googletest' 2024-04-03T12:46:14.6410656Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6412651Z ##[debug]Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-04-03T12:46:14.6414666Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6416363Z ##[debug]Entering 'third_party/flatbuffers' 2024-04-03T12:46:14.6418072Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6419773Z ##[debug]Entering 'third_party/fmt' 2024-04-03T12:46:14.6421362Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6422987Z ##[debug]Entering 'third_party/foxi' 2024-04-03T12:46:14.6424587Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6426369Z ##[debug]Entering 'third_party/gemmlowp/gemmlowp' 2024-04-03T12:46:14.6428148Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6429792Z ##[debug]Entering 'third_party/gloo' 2024-04-03T12:46:14.6431393Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6433125Z ##[debug]Entering 'third_party/googletest' 2024-04-03T12:46:14.6434846Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6436566Z ##[debug]Entering 'third_party/ideep' 2024-04-03T12:46:14.6438219Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6440006Z ##[debug]Entering 'third_party/ideep/mkl-dnn' 2024-04-03T12:46:14.6441817Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6443579Z ##[debug]Entering 'third_party/ios-cmake' 2024-04-03T12:46:14.6445352Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6447022Z ##[debug]Entering 'third_party/ittapi' 2024-04-03T12:46:14.6448723Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6450451Z ##[debug]Entering 'third_party/kineto' 2024-04-03T12:46:14.6452202Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6454348Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-04-03T12:46:14.6456486Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6458870Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-04-03T12:46:14.6461704Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6464383Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-04-03T12:46:14.6466819Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6470683Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-04-03T12:46:14.6473646Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6476080Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-04-03T12:46:14.6478039Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6478914Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-04-03T12:46:14.6479817Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6480654Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-04-03T12:46:14.6481512Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6482404Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-04-03T12:46:14.6483296Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6484133Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-04-03T12:46:14.6484981Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6485822Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-04-03T12:46:14.6486668Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6487388Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-04-03T12:46:14.6488109Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6488940Z ##[debug]Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-04-03T12:46:14.6489700Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6490291Z ##[debug]Entering 'third_party/mimalloc' 2024-04-03T12:46:14.6490881Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6491476Z ##[debug]Entering 'third_party/nccl/nccl' 2024-04-03T12:46:14.6492067Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6492659Z ##[debug]Entering 'third_party/neon2sse' 2024-04-03T12:46:14.6493247Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6493839Z ##[debug]Entering 'third_party/nlohmann' 2024-04-03T12:46:14.6494434Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6495016Z ##[debug]Entering 'third_party/onnx' 2024-04-03T12:46:14.6495586Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6496263Z ##[debug]Entering 'third_party/onnx/third_party/benchmark' 2024-04-03T12:46:14.6496948Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6497619Z ##[debug]Entering 'third_party/onnx/third_party/pybind11' 2024-04-03T12:46:14.6498294Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6498907Z ##[debug]Entering 'third_party/onnx-tensorrt' 2024-04-03T12:46:14.6499522Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6500304Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx' 2024-04-03T12:46:14.6501149Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6502145Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/benchmark' 2024-04-03T12:46:14.6503154Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6504148Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11' 2024-04-03T12:46:14.6505152Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6506227Z ##[debug]Entering 'third_party/onnx-tensorrt/third_party/onnx/third_party/pybind11/tools/clang' 2024-04-03T12:46:14.6507322Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6508052Z ##[debug]Entering 'third_party/pocketfft' 2024-04-03T12:46:14.6508768Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6509479Z ##[debug]Entering 'third_party/protobuf' 2024-04-03T12:46:14.6510189Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6511199Z ##[debug]Entering 'third_party/protobuf/third_party/benchmark' 2024-04-03T12:46:14.6512062Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6512908Z ##[debug]Entering 'third_party/protobuf/third_party/googletest' 2024-04-03T12:46:14.6513757Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6514468Z ##[debug]Entering 'third_party/psimd' 2024-04-03T12:46:14.6515155Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6515955Z ##[debug]Entering 'third_party/pthreadpool' 2024-04-03T12:46:14.6516688Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6517406Z ##[debug]Entering 'third_party/pybind11' 2024-04-03T12:46:14.6518025Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6518651Z ##[debug]Entering 'third_party/python-peachpy' 2024-04-03T12:46:14.6519269Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6519849Z ##[debug]Entering 'third_party/sleef' 2024-04-03T12:46:14.6520426Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6521009Z ##[debug]Entering 'third_party/tbb' 2024-04-03T12:46:14.6521570Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6522168Z ##[debug]Entering 'third_party/tensorpipe' 2024-04-03T12:46:14.6522762Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6523475Z ##[debug]Entering 'third_party/tensorpipe/third_party/googletest' 2024-04-03T12:46:14.6524201Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6524897Z ##[debug]Entering 'third_party/tensorpipe/third_party/libnop' 2024-04-03T12:46:14.6525592Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6526350Z ##[debug]Entering 'third_party/tensorpipe/third_party/libuv' 2024-04-03T12:46:14.6527055Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6527761Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11' 2024-04-03T12:46:14.6528474Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6529252Z ##[debug]Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-04-03T12:46:14.6530041Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6530622Z ##[debug]Entering 'third_party/zstd' 2024-04-03T12:46:14.6531188Z ##[debug]http.https://github.com/.extraheader 2024-04-03T12:46:14.6531745Z ##[debug] 2024-04-03T12:46:14.6532509Z ##[debug]Unsetting HOME override 2024-04-03T12:46:14.6533804Z ##[debug]Node Action run completed with exit code 0 2024-04-03T12:46:14.6536721Z ##[debug]Finished: run 2024-04-03T12:46:14.6541738Z ##[debug]Finishing: Post Checkout PyTorch 2024-04-03T12:46:14.6742525Z ##[debug]Starting: Complete job 2024-04-03T12:46:14.6745590Z Uploading runner diagnostic logs 2024-04-03T12:46:14.6812354Z ##[debug]Starting diagnostic file upload. 2024-04-03T12:46:14.6812985Z ##[debug]Setting up diagnostic log folders. 2024-04-03T12:46:14.6816192Z ##[debug]Creating diagnostic log files folder. 2024-04-03T12:46:14.6853666Z ##[debug]Copying 1 worker diagnostic logs. 2024-04-03T12:46:14.6895298Z ##[debug]Copying 1 runner diagnostic logs. 2024-04-03T12:46:14.6940210Z ##[debug]Zipping diagnostic files. 2024-04-03T12:46:14.7863085Z ##[debug]Uploading diagnostic metadata file. 2024-04-03T12:46:14.7894640Z ##[debug]Diagnostic file upload complete. 2024-04-03T12:46:14.7895591Z Completed runner diagnostic log upload 2024-04-03T12:46:14.7896153Z Cleaning up orphan processes 2024-04-03T12:46:15.0400466Z ##[debug]Finishing: Complete job 2024-04-03T12:46:15.0553279Z ##[debug]Finishing: linux-focal-rocm6.0-py3.8 / test (default, 1, 1, linux.rocm.gpu)